Novel class of proteins and uses thereof for plant resistance to various pathogenic agents

ABSTRACT

Disclosed is substantially pure DNA encoding an  Arabidopsis thaliana  Rps2 polypeptide; substantially pure Rps2 polypeptide; and methods of using such DNA to express the Rps2 polypeptide in plant cells and whole plants to provide, in transgenic plants, disease resistance to pathogens. Also disclosed are conserved regions characteristic of the RPS family and primers and probes for the identification and isolation of additional RPS disease-resistance genes.

[0001] This invention relates to a new class of proteins having an N-terminal portion containing characteristic units of a plant protein of resistance to pathogenic agents and a C-terminal portion containing a DNA-binding domain.

[0002] The invention also relates to nucleic acids coding for such proteins involved in plant resistance to various pathogens and also to means for detecting these proteins and these nucleic acids, such as antibodies or nucleotide probes and primers.

[0003] The invention also relates to recombinant vectors containing a nucleic acid coding for a polypeptide belonging to this new class of proteins, to host cells transformed by a nucleic acid or a recombinant vector according to the invention, and to transgenic plants of which some or all of the cells are transformed by a nucleic acid or a recombinant vector according to the invention.

[0004] The invention also relates to means for increasing or, conversely, inhibiting the expression of a nucleic acid coding for a protein according to the invention in plants, with the aim of increasing the resistance of said plants to various pathogens.

[0005] The invention further relates to methods for screening candidate substances fixing to a polypeptide according to the invention, and to candidate substances obtained according to such methods.

[0006] The improvement of the resistance of plants, and particularly of plants of agronomic interest, to different pathogens is the subject of much research, with the aim of satisfying the technical needs of an agricultural industry sector which more and more uses intensive culture methods.

[0007] The identification of new genes or new proteins able to provide to plants an increased resistance to different bacteria, fungi or viruses has assumed a major economic importance in today's agriculture, enabling the development of large-scale crops with natural resistance to these pathogens and thus not requiring substantial applications of agrochemical products, in particular pesticides, which have several adverse effects on the environment.

[0008] Bacterial wilting is one of the most widespread diseases affecting agricultural crops. It is one of the most important phytobacterioses in the world and is mainly caused by the bacterium Ralstonia solanacearum.

[0009] This vascular pathogen, of soil origin, affects more than 200 plant species including tomato, tobacco, potato or banana, mainly in tropical or sub-tropical zones.

[0010] Various strains of this pathogen have recently been detected in Europe. Study of the resistance of the tomato plant to Ralstonia solanacearum is complicated by its polygenic character. Several genes involved in the establishment of the resistance have been discovered.

[0011] A study by DESLANDES et al. (1998) has recently identified, in the plant Arabidopsis thaliana, a chromosome locus carrying a genetic determinant involved in the resistance to Ralstonia solanacearum. Thus lines of Arabidopsis thaliana, comprising the F₉ generation, obtained by crossing lines of Arabidopsis thaliana respectively sensitive to and resistant to Ralstonia solanacearum, were used to study the co-segregation of markers of the RLFP type with the expression of the resistance to the phenotype of this pathogen responsible for wilting.

[0012] These authors also showed that a genetic determinant of the resistance to Ralstonia solanacearum was localized on chromosome V of Arabidopsis thaliana, between the RFLP markers mi83 and mi61, in other words in a region of about 6.8 cM (approximately 1.7 to 3.4 Mb) of this chromosome.

[0013] According to these authors, the results of the study suggest that the locus of interest localized on chromosome V carries one or more genes, these genes being able either to provide the resistance phenotype, or on the other hand the phenotype of sensitivity to Ralstonia solanacearum, although the genetic basis of the resistance and/or the sensitivity to this pathogen were not identified in Arabidopsis thaliana.

[0014] In addition, DESLANDES et al. (1998) noted that the chromosome region of interest in Arabidopsis thaliana was known to contain several other genetic determinants involved in the recognition of pathogens.

[0015] The applicant has now identified a genetic determinant which is able to provide resistance to Ralstonia solanacearum in Arabidopsis thaliana.

[0016] This is a single gene coding for a polypeptide belonging to a new structural class of proteins. This new class of proteins identified according to the invention is characterized in that it has an N-terminal portion with structural units in common with several known resistance genes and a C-terminal portion having structural units in common with transcription factors containing a DNA-binding region, particularly those designated by the name of WRKY proteins or WRKY transcription factors.

[0017] It has also been shown according to the invention that the transformation of an Arabidopsis thaliana plant sensitive to Ralstonia solanacearum with a nucleic acid coding for a protein such as that defined above was able to provide resistance to different strains of this pathogen to the transformed plant.

[0018] The object of the invention is thus a nucleic acid containing at least 15 consecutive nucleotides of a nucleotide sequence coding for a protein of resistance of a plant to a pathogen, said protein comprising:

[0019] a) an N-terminal portion containing at least one amino acid sequence rich in leucine and at least one nucleotide-binding site; and

[0020] b) a C-terminal portion containing a DNA-binding domain, said binding domain comprising the amino acid sequence “WRKYGQK”.

[0021] The N-terminal portion of this protein preferably also contains at least one domain TIR(TOLL/IL-1R), such a TIR domain being principally defined by its sequence homology with the cytoplasmic domains of the TOLL protein of Drosophila and with the mammalian interleukin-1 receptor protein, whose consensus sequence is described by HAMMOND-KOSACK et al. (1997).

[0022] The N-terminal portion of this protein preferably also contains a loop P(P-LOOP), which is a peptide segment forming a loop and able to fix a phosphate, which is found in protein kinases.

[0023] Such a TIR domain preferably has the amino acid sequence “DEEFVCISCVEEVRYSFVSHLSEALRRKGINNVVVDVDIDDLLFKESQ AKIEKAGVSVMVLPGNCDPSEVWLDKFAKVLECQRNNKIDQAVVSVL YGDSLLRDQWLSELDFRGLSRIHQSRKECSDSILVEEIVRDVYET”

[0024] Such a P loop preferably has the amino acid sequence “CVGIWGMPGIGKTTLAKAV”.

[0025] The N-terminal portion preferably also contains a domain of the NB-ARC type, which is a conserved domain and characteristic of pathogen resistance proteins and of proteins involved in apoptosis mechanisms.

[0026] This NB-ARC domain preferably has the amino acid sequence represented in bold and underlined in FIGS. 1 and 2.

[0027] In addition, the N-terminal portion preferably contains a NLS nuclear signal site.

[0028] Such a NLS domain preferably has the amino acid sequence “KKKLSEMETAFLKLKRRPP”.

[0029] As already stated, the C-terminal portion of a protein coded by a nucleic acid according to the invention contains a WRKY domain characteristic of some protein transcription factors and characterized in that it contains the amino acid sequence “WRKYGQK”.

[0030] This WRKY domain preferably contains the following amino acid sequence: “DXXXWRKYGQKXIXGXXXPRXYYXCXXXXXXXCXXXKXXXXXEXXXXXXXXXYXSXHXH”,

[0031] in which X represents any of the 20 natural amino acids.

[0032] In addition, the C-terminal portion preferably contains a domain rich in leucine.

[0033] The C-terminal portion also preferably contains a characteristic domain of a nuclear localization signal site (NLS), said domain preferably having the amino acid sequence “NFHCWAPGKVVPKVRKD”.

[0034] The C-terminal portion also preferably contains a unit of the “leucine zipper” type, characteristic of a protein binding site, which preferably has the amino acid sequence “LRVSYDDLQEMDKVLFLYIASL”.

[0035] It is highly preferable that a nucleic acid according to the invention codes for a polypeptide containing, from its N-terminal end to its C-terminal end, (i) a TIR domain,

[0036] (ii) a P-LOOP unit,

[0037] (iii) an NB-ARC unit,

[0038] (iv) an NLS domain,

[0039] (v) a first region rich in leucine,

[0040] (vi) a second region rich in leucine,

[0041] (vii) an NLS domain,

[0042] (viii) a unit of the leucine zipper type,

[0043] (ix) a WRKY domain,

[0044] the domains (i) to (ix) being such as defined above.

[0045] A nucleic acid with complementary sequence is also in the scope of the invention.

[0046] A nucleic acid according to the invention is preferably in an isolated and/or purified form.

[0047] The term “isolated” in the context of the present invention means a biological material (nucleic acid or protein) which has been extracted from its original environment (the environment in which it is found naturally).

[0048] For example, a polynucleotide present in the natural state in a plant or an animal is not isolated. The same polynucleotide separated from the adjacent nucleic acids within which it is naturally inserted in the genome of the plant or animal is considered as “isolated”.

[0049] Such a polynucleotide may be included in a vector and/or such a polynucleotide may be included in a composition and nevertheless remain in the isolated state since the vector or the composition does not represent its natural environment.

[0050] The term “purified” does not require that the material is present in a form of absolute purity, excluding the presence of other compounds. It is more of a relative definition. A polynucleotide is in a “purified” state after purification of the starting material or the natural material by at least one order of magnitude, preferably 2 or 3 and more preferably 4 or 5 orders of magnitude.

[0051] For the purposes of the present invention, the expression “nucleotide sequence” may be used to mean either a polynucleotide or a nucleic acid. The expression “nucleotide sequence” includes the genetic material itself and is thus not limited to the information as to its sequence.

[0052] The terms “nucleic acid”, “polynucleotide”, “oligonucleotide” or “nucleotide sequence” include sequences of RNA, DNA, cDNA or RNA/DNA hybrid sequences of more than one nucleotide, either in the single or double stranded form.

[0053] The term “nucleotide” means both the natural nucleotides (A, T, G, C) and modified nucleotides which contain at least one modification such as (1), a purine analogue, (2) a pyrimidine analogue, or (3) an analogous sugar, examples of such modified nucleotides being disclosed for example in the application PCT N^(o) WO 95/04 064.

[0054] For the purposes of the present invention, a first polynucleotide is considered as being “complementary” to a second polynucleotide when each base of the first polynucleotide is paired with a complementary base of the second polynucleotide running in the opposite direction. The complementary bases are A and T (or A and U), or C and G.

[0055] Without wishing to be bound by any specific theory, the inventors consider that the various characteristic units of the protein class according to the invention are such as to provide its biological function which is manifested by the observation of a phenotype of resistance against plant pathogens, particularly against the bacterium R. solanacearum.

[0056] Thus, the N-terminal portion, which contains the TIR, NB-ARC domains and one or more leucine-rich regions are characteristic of many pathogen resistance proteins in plants (R proteins), these proteins being assumed to activate the cascade signal metabolic routes which co-ordinate the initial defence responses of the plant which prevent progress of the pathogens.

[0057] The WRKY domain of the C-terminal portion of a protein according to the invention may have a binding function to regulatory sequences near to or within promoter regions of plant genes, the fixation of a protein according to the invention to DNA being thus able to modulate the activity of other genes potentially involved in the resistance of plants to pathogens, and particularly to R. solanacearum.

[0058] The applicant has isolated and characterized a nucleic acid coding for a polypeptide belonging to the class of plant pathogen resistance proteins defined above, protein RRS1-R.

[0059] More precisely, the applicant has isolated, from the genome of an Arabidopsis thaliana plant resistant to R. solanacearum, the RRS1-R gene which is able to provide to the plant a phenotype of resistance to R. solanacearum, such a nucleic acid comprising the sequence SEQ ID N^(o) 1.

[0060] The invention relates to a nucleic acid containing at least 15 consecutive nucleotides of a nucleotide sequence having at least 40% identity in nucleotides with a polynucleotide of sequence SEQ ID N^(o) 1, as well as to a nucleic acid of complementary sequence.

[0061] According to the invention, a first nucleic acid having at least 40% identity with a second reference nucleic acid, has at least 60%, preferably at least 80%, 85%, 90%, 95%, 98%, 99% or 99.5% identity in nucleotides with this second reference polynucleotide, the percentage identity between two sequences being determined as described below.

[0062] The “percentage identity” between two sequences of nucleotides or amino acids, in the context of the present invention, may be determined by comparing two optimally aligned sequences through a comparison window.

[0063] The portion of the nucleotide or polypeptide sequence in the comparison window may thus contain additions or deletions (for example “gaps”) compared to the reference sequence (which does not contain these additions or deletions) so as to obtain an optimal alignment of the two sequences.

[0064] The percentage is calculated by determining the number of positions in which an identical nucleic base or amino acid residue is observed for the two sequences (nucleic or peptide) compared, then dividing the number of positions in which there is identity between the two bases or amino acid residues by the total number of positions in the comparison window, then multiplying by one hundred to obtain the percentage identity of the sequence.

[0065] The optimal alignment of the sequences for the comparison is performed by computer using known algorithms contained in the software tool of the company WISCONSIN GENETICS SOFTWARE PACKAGE, GENETICS COMPUTER GROUP (GCG), 575 Science Doctor, Madison, Wis.

[0066] The percentage identity between two sequences is preferably performed using the BLAST software (version BLAST 2.06 of September 1998), using exclusively the default parameters (S. F. ALTSCHUL et al., (1990); S F ALTSCHUL et al., 1997).

[0067] The genomic sequence of the RRS1-R gene is referenced as the sequence SEQ ID N^(o) 1.

[0068] A further object of the invention is a nucleic acid which contains or comprises the sequence SEQ ID N^(o) 1.

[0069] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of the polynucleotide of sequence SEQ ID N^(o) 1.

[0070] Any of the genomic nucleic acids according to the present invention may be readily obtained by a person skilled in the art who knows its nucleotide sequence disclosed in the present specification.

[0071] A man skilled in the art may also reproduce any of the genomic nucleic acids according to the invention by constructing, based on the sequences disclosed in the present specification, oligonucleotide primers able to amplify all or part of these nucleic acids from a plant genome, preferably an Arabidopsis thaliana genome or from a bank of vectors (YACS, BACS, cosrmids) containing genomic inserts of a plant, and preferably of Arabidopsis thaliana.

[0072] After amplification using appropriate primers, the various amplified nucleic acids are then subjected to a step of ligation in a plasmid vector so as to obtain the desired genomic nucleic acid, according to techniques well known to a person skilled in the art.

[0073] Moreover, the isolation of the DNA fragment containing the desired genomic insert may be obtained by sub-cloning from a bank (YACS, BACS, cosmids) containing genomic inserts of a plant, and preferably of Arabidopsis thaliana.

[0074] The sequence of the RRS1-R gene contains seven exons and six introns, as well as potential regulatory regions located respectively at the 5′ side of the first exon and on the 3′ side of the last exon, the structural characteristics of these exons and introns being detailed in tables 1 and 2 below. TABLE 1 Sequence of the exons of the RRSI-R gene POSITION OF THE 5′ EXON NUCLEOTIDE IN SEQ ID POSITION OF THE 3′ N° N° 1 NUCLEOTIDE IN SEQ ID N° 1 1  260  636 2  746 1856 3 1937 2236 4 2326 3249 5 3438 4291 6 5377 5499 7 6085 6532

[0075] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of an exonic polynucleotide of the RRS1-R gene, such as the polynucleotides 1 to 7 described in table 1 above, which are all included in the nucleic acid of sequence SEQ ID N^(o) 1.

[0076] In general, a nucleic acid having at least 15 consecutive nucleotides of a sequence according to the invention advantageously has at least 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, 500, 1 000, 2 000, 3 000, 4 000 consecutive nucleotides of the reference sequence, the length of consecutive nucleotides being naturally limited by the length of the reference sequence.

[0077] Such a nucleic acid codes for at least a part of the RRS1-R polypeptide and may in particular be inserted in a recombinant vector intended for the expression of the corresponding translation product in a host cell or in a plant transformed with this recombinant vector.

[0078] Such a nucleic acid may also be used for the synthesis of nucleotide probes and primers intended for the detection or amplification of nucleotide sequences contained in the RRS1-R gene in a sample, or optionally of sequences of the RRS1-R genes carrying one or more mutations, preferably one or more mutations which would modify the phenotype of a plant carrying such a mutant RRS1-R gene, for example by modifying the regulation of mechanisms of resistance to pathogens, and preferably mechanisms or resistance to R. solanacearum. TABLE 2 Regulatory and intron sequences of the RRS-IR gene Position of the 5′ nucleotide Position of the 3′ nucleotide in Intron n° in SEQ ID N° 1 SEQ ID N° 1 1  637  745 2 1857 1936 3 2237 2325 4 3250 3437 5 4292 5376 6 5500 6084

[0079] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of an intronic polynucleotide of the RRS1-R gene, such as polynucleotides 1 to 6 described in table 2 above, which are all included in the nucleic acid of sequence SEQ ID N^(o) 1.

[0080] Such a nucleic acid may be used as an oligonucleotide probe or primer to detect the presence of at least one copy of the RRS1-R gene in a sample, or also to amplify a given target sequence within the RRS1-R gene.

[0081] In addition, the nucleic acid of the RRS1-R gene of sequence SEQ ID N^(o) 1 contains an ATG codon beginning at the nucleotide in position 260 (1st Exon) and a polyadenylation signal between the nucleotides in positions 6642 and 6647 inclusive.

[0082] The nucleic acid of the RRS1-R gene of sequence SEQ ID N^(o) 1 contains regulatory sequences located respectively on the 5′ side of exon 1 and the 3′ side of exon 7.

[0083] The 5′ regulatory region of the RRS1-R gene comprises the nucleotide sequence SEQ ID N^(o) 2.

[0084] The polynucleotide of sequence SEQ ID N^(o) 2 is located between the nucleotide in position 1 and the nucleotide in position 259 of sequence SEQ ID N^(o) 1.

[0085] The 3′ regulatory sequence of the RRS1-R gene comprises the nucleotide sequence SEQ ID N^(o) 3.

[0086] The polynucleotide of sequence SEQ ID N^(o) 3 is located between the nucleotide in position 6533 and the nucleotide in position 7936 of the nucleotide sequence SEQ ID N^(o) 1.

[0087] The polynucleotide derived from the regulatory regions of the RRS1-R gene above is useful for detecting the presence of at least one copy of this gene in a sample.

[0088] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of a polynucleotide of sequence SEQ ID N^(o) 2 or SEQ ID N^(o) 3.

[0089] According to another embodiment, the invention also relates to a nucleic acid comprising a polynucleotide having at least 80% identity in nucleotides with a polynucleotide of sequence SEQ ID N^(o) 2 or SEQ ID N^(o) 3, advantageously at least 90%, 95%, 98%, 99%, 99.5% and more preferably 99.8% of identity in nucleotides with a polynucleotide selected from the sequences SEQ ID N^(o) 2 and SEQ ID N^(o) 3, or a biologically active fragment of the latter.

[0090] Preferred fragments of the nucleotide sequences SEQ ID N^(o) 2 or SEQ ID N^(o) 3 advantageously have a length of between 50, 100, 150, 200 and 300, 400, 600, 1 000 or 2 000 bases.

[0091] “Biologically active” fragment of a polynucleotide of sequence SEQ ID N^(o) 2 or SEQ ID N^(o) 3 according to the invention means a polynucleotide containing or consisting of a polynucleotide able to regulate the expression of a nucleic acid placed close to the latter, in a recombinant host cell.

[0092] For the purposes of the present invention, a nucleic acid constitutes a “functional” regulatory region to express a nucleic acid placed close to the latter, for example a nucleic acid coding for a polypeptide or a polynucleotide of interest, if this regulatory polynucleotide contains the nucleotide sequence containing the regulation signals of the transcriptions and/or the translation, and if the sequences are located in such a way as effectively to induce or increase the transcription or translation of said polypeptide or polynucleotide of interest.

[0093] In order to identify the “biologically active” fragments of the polynucleotides of sequence SEQ ID N^(o) 2 or SEQ ID N^(o) 3, a person skilled in the art may advantageously refer to the book by SAMBROOK et al. (1989) which describes the use of a recombinant vector carrying a marker gene (for example β-galactosidase, chloramphenicol acetyltransferase, etc.) whose expression is detected on placing the sequence of the marker gene under the control of the biologically active polynucleotide fragments of the sequences SEQ ID N^(o) 2 or SEQ ID N^(o) 3.

[0094] The genomic sequences located upstream of the first exon of the RRS1-R gene are cloned in an appropriate vector containing a marker gene, such as the GUS gene (Jefferson et al., 1987).

[0095] The polynucleotide fragments of sequence SEQ ID N^(o) 2 or SEQ ID N^(o) 3 according to the invention may be prepared from any of the sequences SEQ ID N^(o) 1, SEQ ID N^(o) 2 and SEQ ID N^(o) 3 by cleavage using appropriate restriction endonucleases, as described for example in the book by SAMBROOK et al. (1989) cited above.

[0096] The regulatory polynucleotide fragments according to the invention may also be prepared by digestion of any of the sequences SEQ ID N^(o) 1, SEQ ID N^(o) 2 and SEQ ID N^(o) 3 by an exonuclease enzyme, such as for example Bal3I as described by WABIKO et al. (1986), WABIKO H. et al., 1986, DNA, volume 5 (4): 305-314.

[0097] Such regulatory polynucleotides may also be prepared by chemical synthesis of the nucleic acids according to techniques well known to a person skilled in the art, such as the phosphoramidite techniques cited above.

[0098] The level of activity and tissue specificity of the regulatory sequences, in particular the promoter sequences of the RRS1-R gene according to the invention may be determined by measuring the level of expression of a detectable polynucleotide placed under the control of the latter in different types of plant cells and tissues. The detectable polynucleotide may be either a polynucleotide which is specifically hybridized with an oligonucleotide probe of predetermined sequence, or a polynucleotide coding a detectable protein, including the polypeptide RRS1-R or a fragment of this.

[0099] A test allowing such a verification is well known to a person skilled in the art and is in particular disclosed in the U.S. Pat. No. 5,502,176 and U.S. Pat. No. 5,266,488, incorporated herein by reference.

[0100] The invention thus also relates to a nucleic acid containing:

[0101] a) a nucleic acid comprising a regulatory nucleotide sequence of at least 50 consecutive nucleotides of the sequence SEQ ID N^(o) 3 or a sequence having at least 80% identity in nucleotides with sequence SEQ ID N^(o) 2;

[0102] b) a polynucleotide coding a polypeptide of interest or a polynucleotide of interest whose transcription and/or translation is placed under the control of the regulatory nucleic acid a);

[0103] c) optionally, a regulatory nucleic acid comprising at least 50 consecutive nucleotides of the sequence SEQ ID N^(o) 3 or a sequence having at least 80% identity in nucleotides with sequence SEQ ID N^(o) 3.

[0104] The polypeptide of interest coded by the nucleic acid b) above may be of various types and origins; it may be a protein of eukaryotic or prokaryotic origin.

[0105] The polypeptides of interest include the toxic polypeptides, such as for example the elicitins, for plant cells, their expression being such as to induce the death of the cell in which they are expressed and as a result a confinement of the pathogens which have infected the plant, thus preventing the propagation of the pathogen to all the organs of the plant. For example, it is possible to generate transgenic tobaccos having an increased resistance to the pathogens expressing eliciting, small toxic peptides produced by a phytopathogenic fungus, according to the technique described by Keller et al. (1999).

[0106] The nucleic acid of interest mentioned in b) above, generally an RNA molecule, may be complementary to a target polynucleotide, for example a polynucleotide located in a region coding the RRS1-R gene, and may thus be advantageously used as an antisense polynucleotide.

[0107] It has been shown according to the invention that the RRS1-R gene is transcribed in the form of a messenger RNA which has been isolated and characterized. This messenger RNA contains a unique open reading frame coding for the RRS1-R protein which is 1378 amino acids long.

[0108] The messenger RNA of the RRS1-R gene and the corresponding cDNA may be readily obtained by a skilled person in the art, for example by screening a cDNA band using a suitable probe or by rapid amplification of the cDNA ends (PCR, RACE-PCR). A person skilled in the art could also obtain the cDNA of the RRS1-R gene by using the techniques described in example 6, or by direct chemical synthesis.

[0109] A skilled person in the art could also reproduce a nucleic acid according to the invention by direct chemical synthesis, such as the phosphodiester method described by NARANG et al. (1979), the phosphodiester method described by BROWN et al. (1979), the diethylphosphoramidite method described by BEAUCAGE et al. (1981), as well as the solid phase method described in the European patent application EP 0 707 592, the content of these documents being incorporated herein by reference. On either side of the open reading frame, this messenger RNA contains respectively an untranslated 5′ region (5′-UTR) and an untranslated 3′ region (3′-UTR).

[0110] The cDNA resulting from the transcription of the RRS1-R gene is referenced as the sequence SEQ ID N^(o) 4.

[0111] The 5′-UTR sequence of the messenger RNA transcribed by the RRS1-R gene is composed of the sequence between the nucleotide in position 1 and the nucleotide in position 81 of the sequence SEQ ID N^(o) 4.

[0112] The 3′-UTR sequence of the messenger RNA transcribed by the RRS1-R gene is composed of the sequence between the nucleotide in position 4219 and the last nucleotide at the 3′-end of the sequence SEQ ID N^(o) 4.

[0113] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of a polynucleotide selected from the 5′-UTR and 3′-UTR sequences of the messenger RNA of the RRS1-R gene described above, a polynucleotide having at least 80% identity in nucleotides with such a nucleic acid and a nucleic acid with complementary sequence to said nucleic acids or said polynucleotides.

[0114] A nucleic acid which contains or consists of the nucleotide sequence SEQ ID N^(o) 4 is also in the scope of the invention.

[0115] The 5′-UTR and 3′-UTR regions of the messenger RNA of the RRS1-R gene may contain components for regulating the transcription and/or translation of this gene, such as one or more ribosome binding sites, one or more polyadenylation sites, sequences enhancing the stability of the messenger RNA or all or part of the promoter region of the transcription.

[0116] The applicant has also isolated and characterized, from an ecotype of Arabidopsis thaliana sensitive to the R. solanacearum pathogen, the gene corresponding to the RRS1-R gene, containing, compared to the sequence of the RRS1-R gene, several additions and substitutions of nucleotides, the mutant gene found in the sensitive ecotype of Arabidopsis thaliana being designated RRS1-S for the purposes of the present specification.

[0117] The RRS1-S gene contains a first mutation in the portion coding for the C-terminal region of the protein, leading to the replacement of the amino acid sequence “SEASKLERL” between the amino acid in position 704 and the amino acid in position 712 of the polypeptide coded by the RRS1-R gene of sequence SEQ ID N^(o) 1 by the amino acid sequence “SEELERL” between the amino acid in position 704 and the amino acid in position 710 of the polypeptide coded by the RRS1-S gene. This mutation thus leads, on the one hand, to the substitution of the alanine residue in position 706 of the protein coded by the RRS1-R gene by a glutamic acid residue in position 706 of the protein coded by the RRS1-S gene and, on the other hand, by the deletion of the amino acids serine and lysine in positions 707 and 708 respectively of the amino acid sequence of the protein coded by the RRS1-R gene of sequence SEQ ID N^(o) 1.

[0118] A second mutation consists of the insertion of an early stop codon found in the coding region of the RRS1-S gene, leading to a deletion of 90 amino acids located at the C-terminal end of the RRS1-R protein, the polypeptide coded by the RRS1-S gene being as a result of 1 288 amino acids.

[0119] Without wishing to be bound by any particular theory, the applicant considers that the different mutations carried by the RRS1-S gene identified in the Col-5 ecotype of Arabidopsis thaliana, which is sensitive to the R. solanacearum pathogen, are responsible for the phenotype of sensitivity to R. solanacearum observed in this ecotype.

[0120] The result is that the different modifications observed in the nucleotide sequence of the RRS1-S gene compared to the nucleotide sequence of the RRS1-R gene are such as significantly to alter the biological activity of the resulting polypeptide.

[0121] In consequence, the nucleotide sequence of the RRS1-S gene found in the Col-5 ecotype of Arabidopsis thaliana is especially useful for developing various specific means for detection of the RRS1-S gene, such means of detection allowing a skilled person in the art to determine if a plant of interest contains the RRS1-S gene in its genome and is as a result sensitive to the bacterial pathogen R. solanacearum.

[0122] The genomic sequence of the RRS1-S gene is referenced as the sequence SEQ ID N^(o) 5. A further object of the invention is a nucleic acid which comprises or contains the sequence SEQ ID N^(o) 5.

[0123] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of a polynucleotide of sequence SEQ ID N^(o) 5.

[0124] The invention further relates to a nucleic acid having at least 40% of identity in nucleotides with the nucleotide sequence SEQ ID N^(o) 5 or with a polynucleotide containing at least 15 consecutive nucleotides of the nucleotide sequence SEQ ID N^(o) 5, as well as a nucleic acid with complementary sequence.

[0125] The sequence of the RRS1-S gene contains 7 exons and 6 introns, whose structural characteristics are detailed in tables 3 and 4 below respectively. TABLE 3 EXON SEQUENCES OF THE RRS1-S GENE Position of the 5′ nucleotide in Position of the 3′ nucleotide in Exon n° SEQ ID N° 5 SEQ ID N° 5 1 1184 1560 2 1670 2780 3 2861 3160 4 3254 4171 5 4360 5213 6 6302 6424 7 6953 7136

[0126] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of an exonic polynucleotide of the RRS1-S gene, such as the polynucleotides 1 to 7 described in table 3 above, which are all included in the nucleic acid of sequence SEQ ID N^(o) 5.

[0127] Such a nucleic acid codes for at least a part of the RRS1-S polypeptide and may in particular be inserted into a recombinant vector intended for the expression of the corresponding translation product in a host cell or in a plant transformed with this recombinant vector.

[0128] Such a nucleic acid may also be used for the synthesis of nucleotide probes and primers intended for the detection or amplification of nucleotide sequences contained in the RRS1-S gene in a sample. TABLE 4 INTRON SEQUENCES OF THE RRS1-S GENE Position of the 5′ nucleotide in Position of the 3′ nucleotide in Intron n° SEQ ID N° 5 SEQ ID N° 5 1 1561 1669 2 2781 2860 3 3161 3253 4 4172 4359 5 5214 6301 6 6425 6952

[0129] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of an intronic polynucleotide of the RRS1-S gene, such as the polynucleotides 1 to 7 described in table 4 above, which are all included in the nucleic acid of sequence SEQ ID N^(o) 5.

[0130] Such a nucleic acid may be used as an oligonucleotide probe or primer to detect the presence of at least one copy of the RRS1-S gene in a sample, or to amplify a given target sequence within the RRS1-S gene.

[0131] Such probes are preferably constructed so as to hybridize specifically with regions of the RRS1-S gene which contain one or more substitutions, additions or deletions of bases compared to the nucleotide sequence of the RRS1-R gene of sequence SEQ ID N^(o) 1.

[0132] Such primers preferably allow the amplification of regions of the RRS1-S gene which contain one or more substitutions, additions or deletions of bases compared to the nucleotide sequence of the RRS1-R gene of sequence SEQ ID N^(o) 1.

[0133] The applicant has also isolated and characterized the untranscribed nucleotide sequences located respectively on the 5′ side and the 3′ side of the coding regions of the RRS1-S gene.

[0134] These untranscribed 5′ and 3′ sequences are able to carry the signals for regulating the transcription and/or translation of the RRS1-S gene and are also in the scope of the invention, and are respectively referenced as sequences SEQ ID N^(o) 6 and SEQ ID N^(o) 7.

[0135] The invention thus also relates to a nucleic acid containing at least 50 consecutive nucleotides of a regulatory polynucleotide of the RRS1-S gene selected from the nucleotide sequences SEQ ID N^(o) 6 and SEQ ID N^(o) 7 and the sequences having at least 80% identity in nucleotides with one of the sequences SEQ ID N^(o) 6 and SEQ ID N^(o) 7, as well as a nucleic acid with complementary sequence to said nucleic acids and polynucleotides.

[0136] The invention also relates to a nucleic acid containing:

[0137] a) a 5′ regulatory polynucleotide of the RRS1-S gene such as defined above;

[0138] b) a polynucleotide coding for a polypeptide or a polynucleotide of interest;

[0139] c) optionally, a 3′ regulatory polynucleotide of the RRS1-S gene such as defined above,

[0140] the nucleic acid coding for the polypeptide or the polynucleotide of interest being placed under the control of the 5′ regulatory polynucleotide of the RRS1-S gene and, optionally, also under the control of the 3′ regulatory polynucleotide of the RRS1-S gene.

[0141] The applicant has shown that the RRS1-S gene is transcribed in the form of a messenger RNA which has been isolated and characterized. This messenger RNA contains a unique open reading frame coding for the RRS1-S protein which is 1288 amino acids in length.

[0142] On either side of the open reading frame, this messenger RNA contains respectively a 5′ untranslated region (5′-UTR) and a 3′ untranslated region (3′UTR).

[0143] The cDNA of the RRS1-S gene is referenced as sequence SEQ ID N^(o) 8.

[0144] The 5′-UTR sequence of the messenger RNA transcribed by the RRS1-S gene is composed of the sequence between the nucleotide in position 1 and the nucleotide in position 81 of the sequence SEQ ID N^(o) 8.

[0145] The 3′-UTR sequence of the messenger RNA transcribed by the RRS1-S gene is composed of the sequence between the nucleotide in position 3949 and the last nucleotide at the 3′-end of the sequence SEQ ID N^(o) 8.

[0146] The invention also relates to a nucleic acid containing at least 15 consecutive nucleotides of a polynucleotide of sequence SEQ ID N^(o) 8 or 15 consecutive nucleotides of a sequence having at least 40% identity in nucleotides with sequence SEQ ID N^(o) 8, as well as a complementary sequence of said nucleic acid.

[0147] The nucleic acids according to the invention, and in particular the nucleotide sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8, their fragments of at least 15 nucleotides, the sequences having at least 40% identity in nucleotides with at least a part of the sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8, as well as the nucleic acids with complementary sequences, are useful for the detection of the presence of at least one copy of a nucleotide sequence of the RRS1-R or RRS1-S gene or a fragment or an allelic variant of these genes in a sample.

[0148] The nucleotide probes and primers which hybridize, under very strict hybridization conditions, with a nucleic acid selected from the sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8, are also in the scope of the invention.

[0149] The following hybridization conditions constitute very strict hybridization conditions in the context of the invention:

[0150] the DNA to be tested is immobilized on membranes of the Hybond-N⁺ type (Amersham, Buckinghamshire, UK) in accordance with the manufacturer's instructions, in the presence of 0.4 M NaOH overnight;

[0151] the membranes are washed with a buffer 5×SSC (1×SSC corresponds to 0.15 M NaCl+0.015 M sodium citrate), then the membranes are treated with UV at 312 nm for 3 minutes, then pre-hybridized for at least 4 hours at 65° C. in a hybridization buffer (6×SSC, 5× Denhardt's, 100 μg of calf thymus single-stranded DNA/ml and 0.5% SDS [weight/volume]).

[0152] The probes are added to the membranes and incubated at 65° C. overnight.

[0153] After the hybridization step, the membranes are washed in 500 ml of a buffer 2×SSC, 1% SDS (weight/volume) at laboratory temperature for 30 minutes;

[0154] a second washing is performed in 500 ml of a buffer 0.1×SSC, 0.1% SDS (weight/volume) for 15 minutes at 42° C.

[0155] The hybridization conditions described above are suitable for hybridization under very strict conditions of a nucleic acid molecule of 300 to 400 nucleotides in length.

[0156] Needless to say, the hybridization conditions described above may be adapted as a function of the length of the nucleic acid whose hybridization is sought or the type of marking selected, according to techniques known to a person skilled in the art.

[0157] Appropriate hybridization conditions may for example be adapted according to the teaching contained in the book by HAMES and HIGGINS (1985) or in that by AUSUBEL et al. (1989).

[0158] The nucleotide probes or primers according to the invention contain at least 15 consecutive nucleotides of a nucleic acid according to the invention, in particular of a nucleic acid of sequence SEQ ID N^(o) 1 to SEQ ID N^(o) 8 or its complementary sequence, of a nucleic acid having at least 40% identity in nucleotides with a sequence selected from the sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8 or its complementary sequence, or of a nucleic acid hybridizing, under very strict hybridization conditions, with a sequence selected from the sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8 or its complementary sequence.

[0159] The nucleotide probes or primers according to the invention preferably have a length of at least 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400 or 500 consecutive nucleotides of a nucleic acid according to the invention, in particular of a nucleic acid with nucleotide sequence selected from the sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8, or of a nucleic acid of complementary sequence.

[0160] According to another embodiment a nucleotide probe or primer according to the invention contains or comprises fragments of a length of 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400 or 500 consecutive nucleotides of a nucleic acid according to the invention, more particularly of a nucleic acid selected from the sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8, or of a nucleic acid of complementary sequence.

[0161] Examples or primers and primer pairs allowing amplification of different regions of the RRS1-R gene are for example the sequences SEQ ID N^(o) 11 to SEQ ID N^(o) 61 represented in table 6.

[0162] A nucleotide probe or primer according to the invention may be prepared by any suitable method known to a person skilled in the art, including cloning and action of restriction enzymes or by direct chemical synthesis according to techniques such as the phosphodiester method of Narang et al. (1979) or of Brown et al. (1979) cited above.

[0163] Any of the nucleic acids according to the invention, including the oligonucleotide probes and primers described above, may be marked, if desired, by incorporating a marker detectable by spectroscopic, photochemical, biochemical, immunochemical or chemical means.

[0164] For example, such markers may consist of radioactive isotopes (³²P,³H,¹³⁵S), fluorescent molecules (5-bromodeoxyuridine, fluorescein, acetylaminofluorene, digoxigenin) or ligands such as biotin.

[0165] The marking of a nucleic acid is preferably performed by incorporation of the marked molecules within nucleotides by extension of the primers, or by addition to the 5′ or 3′ ends.

[0166] Examples of non-radioactive marking of nucleic acid fragments are described in particular in the patent FR 78 19 175 or in the articles by URDEA et al. (1988), or SANCHEZ-PESCADOR et al. (1988).

[0167] The probes according to the invention may advantageously have structural properties enabling an amplification of the signal, such as the probes described by URDEA et al. (1991) or in the European patent n^(o) EP 0 225 807 (Chiron).

[0168] The oligonucleotide probes according to the invention may in particular be used in hybridizations of the Southern type to genomic DNA, or in hybridizations to the messenger RNA of the RRS1-R and RRS1-S genes, when the expression of the corresponding transcript is sought in a sample.

[0169] The probes according to the invention may also be used for the detection of PCR amplification products or also for the detection of mismatches.

[0170] Nucleotide probes or primers according to the invention may be immobilized on a solid phase. Such solid phases are well known to a person skilled in the art and include the surfaces of microtitration plate wells, polystyrene beads, magnetic beads, nitro-cellulose tapes or microparticles such as latex particles.

[0171] The present invention also relates to a method for detecting the presence of a nucleic acid of the RRS1-R or RRS1-S gene in a sample, said method comprising the steps of:

[0172] placing a nucleotide probe or several probes according to the invention with the sample to be tested;

[0173] detecting any complex which may be formed between the probes and the nucleic acid present in the sample.

[0174] According to a specific embodiment of the detection method according to the invention, the nucleotide probe or probes are immobilized on a solid phase.

[0175] According to a further embodiment, the oligonucleotide probes contain a detectable marker.

[0176] The invention also relates to a pack or kit for detecting the presence of a nucleic acid of the RRS1-R or RRS1-S gene in a sample, said kit comprising:

[0177] a) one or more nucleotide probes, such as described above;

[0178] b) optionally, the reagents necessary for the hybridization reaction.

[0179] According to a first embodiment, the detection pack or kit is characterized in that the probe or probes are immobilized on a solid phase.

[0180] According to a second embodiment, the detection pack or kit is characterized in that the oligonucleotide probes contain a detectable marker.

[0181] According to a particular embodiment of the detection kit described above, such a kit contains several oligonucleotide probes according to the invention which may be used to detect target sequences of interest of the RRS1-R or RRS1-S gene or to detect mutations of the coding or non-coding regions of the RRS1-R gene, more particularly the nucleic acids of the sequences SEQ ID N^(o) 1 to SEQ ID N^(o) 8 or the nucleic acids of complementary sequence.

[0182] The nucleotide primers according to the invention may be used to amplify any nucleotide fragment (gDNA, cDNA, mRNA) of RRS1-R or RRS1-S, and more particularly all or part of a nucleic acid of sequence SEQ ID N^(o) 1 to SEQ ID N^(o) 8.

[0183] Another object of the invention relates to a method for amplifying a nucleic acid of the RRS1-R or RRS1-S gene, and more particularly a nucleic acid of sequence SEQ ID N^(o) 1 to SEQ ID N^(o) 8 or a fragment or a nucleic acid of complementary sequence to the latter, contained in a sample, said method comprising the steps of:

[0184] a) placing the sample in which the presence of the target nucleic acid is suspected in contact with a pair of nucleotide primers whose position of hybridization is located respectively on the 5′ side and on the 3′ side of the region of the target nucleic acid of RRS1-R or RRS1-S whose amplification is sought, in the presence of the reagents necessary for the amplification reaction and;

[0185] b) detecting any nucleic acid amplified.

[0186] According to the amplification method above, at least one amplification cycle of the nucleic acid contained in the sample is performed before the detection of any nucleic acid amplified, preferably at least 10, and more preferably at least 20, cycles of amplification.

[0187] It is advantageous to use any one of the nucleotide primers described above to perform the above amplification method.

[0188] A further object of the invention is a pack or kit for amplifying a nucleic acid of the RRS1-R or RRS1-S gene according to the invention, and more particularly all or part of a nucleic acid of sequence SEQ ID N^(o) 1 to SEQ ID N^(o) 8, said pack or kit comprising:

[0189] a) a pair of nucleotide primers according to the invention, whose hybridization position is located respectively at the 5′ side and the 3′ side of the target nucleic acid of the RRS1-R or RRS1-S gene whose amplification is sought;

[0190] b) optionally, the reagents necessary for the amplification reaction.

[0191] Such an amplification pack or kit advantageously contains at least one pair of nucleotide primers such as described above.

[0192] According to a preferred embodiment, primers according to the invention contain all or part of a polynucleotide selected from the nucleotide sequences SEQ ID N^(o) 11 to SEQ ID N^(o) 61.

[0193] The applicant has shown that plants carrying the RRS1-R gene have a phenotype of resistance to R. solanacearum. In addition, the applicant has also shown that the insertion of the RRS1-R gene or of the cDNA of the RRS1-R gene into the genome of a plant initially sensitive to R. solanacearum provides to this plant a phenotype of resistance to this pathogen.

[0194] The invention also relates to methods and means designed to inhibit or block the expression of the RRS1-S gene found in plants sensitive to R. solanacearum, particularly with a view to increasing their resistance to different pathogens, by any technique known to a person skilled in the art.

[0195] In order to inhibit or block the expression of the RRS1-S gene in a plant, a skilled person in the art may use antisense polynucleotides.

[0196] Thus, the invention also relates to an antisense polynucleotide, able to hybridize specifically with a given region of the RRS1-S gene and able to inhibit or to block its transcription and/or translation. Such a polynucleotide has the general structure which has been defined above for the probes and primers according to the invention.

[0197] An antisense polynucleotide according to the invention preferably hybridizes with a sequence corresponding to a sequence located in the region of the 5′ end of the RRS1-S messenger RNA, and more preferably close to the codon for initiating the translation (ATG) of the RRS1-S gene.

[0198] According to a second preferred embodiment, an antisense polynucleotide according to the invention contains a sequence corresponding to one of the sequences located at the exon/intron junctions of the RRS1-S gene and preferably sequences corresponding to a splicing site, which may be determined according to techniques well known to a person skilled in the art, based on the description of the sequences of the RRS1-S gene of the present specification.

[0199] In order to synthesize the antisense polynucleotides such as defined above, a person skilled in the art may refer to tables 3 and 4 in which the positions of the different exons and introns of the RRS1-S gene in the sequence SEQ ID N^(o) 5 are listed.

[0200] The antisense polynucleotides must in general have a length and a melting point sufficient to form an intracellular duplex hybrid with a sufficient stability to inhibit the expression of the RRS1-S mRNA.

[0201] The strategies which may be used by a person skilled in the art to construct the antisense polynucleotides are particularly described by GREEN et al. (1986) and by IZANT and WEINTRAIB (1984), the content of these two articles being thus incorporated herein by reference.

[0202] Methods for construction of antisense polynucleotides which may be used by a person skilled in the art are also described by ROSSI et al. (1991) and in the PCT application n^(o) WO 94/23 026, WO 95/04141, WO 92/18522 and in the European patent application n^(o) EP 0 572 287, the content of these documents being incorporated by reference.

[0203] An antisense polynucleotide according to the invention advantageously has a length of 15 to 4 000 nucleotides. An antisense polynucleotide of the invention preferably has a length of between 15, 20, 25, 30, 35, 40, 45 or 50 to 75, 100, 200, 500, 1 000, 2 000, 3 000 or 4 000 nucleotides.

[0204] The preferred antisense polynucleotides according to the invention 10 are those having respectively a length of about 300 nucleotides or a length of about 4 000 nucleotides.

[0205] In order to inhibit or block the expression of the RRS1-S gene, it is possible simultaneously to use several antisense polynucleotides such as defined above, each of the antisense polynucleotides hybridizing with a specific region of the RRS1-S gene or of its messenger RNA.

[0206] The antisense polynucleotides according to the invention are more preferably defined in such a way that they hybridize with a region of the RRS1-S gene which has, in comparison with the corresponding sequence of the RRS1-R gene, one or more substitutions, deletions, additions of at least one base.

[0207] Other methods of using the antisense polynucleotides are for example those described by SCZAKIEL et al. (1995).

[0208] Another strategy for inhibiting or blocking the expression of the RRS1-S gene consists of using polynucleotides able to form a triple DNA helix with the genomic region of double-stranded DNA carrying the RRS1-S gene.

[0209] In general, homopurine sequences are considered as the most useful in this type of strategy, although homopyrimidine sequences may also be used.

[0210] A person skilled in the art may advantageously refer to the genomic sequence SEQ ID N^(o) 5 of RRS1-S in order to select in this sequence sequences of homopurine or of homopyrimidine of 10 to 20 nucleotides long, which are able to inhibit the expression of the RRS1-S gene.

[0211] Methods of inhibiting the expression of a gene by the triple helix technique are for example those described by GRIFFIN et al. (1989).

[0212] A further object of the invention is thus the use of an antisense polynucleotide or of a homopyrimidine polynucleotide, such as defined above, or of a recombinant vector containing such a polynucleotide, to inhibit or block the expression of the RRS1-S gene in a plant cell or in a whole plant.

[0213] As already noted, complementation experiments with Arabidopsis thaliana plants belonging to an ecotype sensitive to R. solanacearum by a recombinant vector containing the genomic sequence or a cDNA sequence of the RRS1-R gene revealed, in the transformed plant, the appearance of a phenotype of resistance to R. solanacearum.

[0214] In order to stimulate the resistance of a plant to different pathogens, and more particularly to R. solanacearum, it would thus be advantageous either to introduce or to increase the number of copies of the RRS1-R gene in this plant, or to encourage the expression of the RRS1-R gene, these two methods being likely to prevent the development of such a pathogen.

[0215] A strong expression of the RRS1-R gene in a plant may be achieved either by overexpression of the RRS1-R gene, or by insertion of multiple copies of a polynucleotide coding for the RRS1-R protein into the plant, or by a combination of these two strategies. For the insertion of multiple copies of a polynucleotide coding for the RRS1-R protein into the genome of a plant, the use of a recombinant vector according to the invention is advantageous.

[0216] The invention also relates to a recombinant vector containing a nucleic acid according to the invention.

[0217] Such a recombinant vector advantageously contains a nucleic acid selected from among the following nucleic acids:

[0218] a nucleic acid containing at least 15 consecutive nucleotides of a nucleotide sequence coding for a protein for resistance of a plant to a pathogen, said protein comprising:

[0219] (i) an N-terminal portion containing at least one sequence of amino acids rich in leucine and at least one nucleotide binding site; and

[0220] (ii) a C-terminal portion containing a DNA binding domain comprising the amino acid sequence “WRKYGQK”, as well as a nucleic acid of complementary sequence;

[0221] b) a nucleic acid containing a sequence having at least 40% identity in nucleotides with the nucleotide sequence SEQ ID N^(o) 1, as well as a nucleic acid of complementary sequence;

[0222] c) a nucleic acid containing a sequence having at least 40% identity in nucleotides with the nucleotide sequence SEQ ID N^(o) 5;

[0223] d) a nucleic acid containing at least 15 consecutive nucleotides of a polynucleotide consisting of one of the exons 1 to 7 of the RRS1-R gene or of the RRS1-S gene, such as defined above;

[0224] e) a nucleic acid comprising a polynucleotide of at least 15 consecutive nucleotides of a polynucleotide consisting of one of the introns 1 to 7 of the RRS1-R or RRS1-S gene, such as defined above;

[0225] f) a nucleic acid having at least 15 consecutive nucleotides of a regulatory polynucleotide of the RRS1-R or RRS1-S gene, such as defined above;

[0226] g) an antisense polynucleotide or a homopurine or homopyrimidine polynucleotide, such as defined above, useful for inhibiting the expression of the RRS1-S gene;

[0227] “Vector”, in the sense of the present invention, means a circular or linear molecule of DNA or RNA which may be either in single or double-stranded form.

[0228] A recombinant vector according to the invention may be either a cloning vector, an expression vector, or more specifically an insertion vector, a transformation vector or an integration vector.

[0229] It may be a vector of bacterial or viral origin.

[0230] According to a first embodiment, a recombinant vector according to the invention is used with the aim of amplifying the nucleic acid which is inserted after transformation or transfection of the desired host cell.

[0231] According to a second embodiment, the vector is an expression vector containing, in addition to a nucleic acid coding for a polypeptide according to the invention, in particular polypeptides coded by the RRS1-R and RRS1-S genes, regulatory sequences allowing directing the transcription and/or the translation.

[0232] According to an advantageous embodiment, a recombinant vector according to the invention particularly contains the following components:

[0233] 1) components regulating the expression of the nucleic acid of the RRS1-R gene or of the RRS1-S gene to be inserted, such as promoters and enhancer sequences;

[0234] 2) the sequence contained in the nucleic acid of the RRS1-R or RRS1-S gene according to the invention to be inserted in such a vector, said sequence being placed under the control of the regulation signals described in (1); and

[0235] 3) initiation and stop sequences of the appropriate transcription.

[0236] In addition, the recombinant vectors according to the invention may include one or more replication origins in the host cells in which their amplification or their expression is sought as well as selection markers.

[0237] In a particular embodiment, a recombinant vector according to the invention contains an antisense polynucleotide or a homopurine or homopyridine polynucleotide, such as defined above, optionally placed under the control of suitable regulatory sequences ensuring the expression in a selected host cell or plant. Such a recombinant vector is preferably used to inhibit the expression of the RRS1-S gene in the cell or in the plant.

[0238] According to another particular embodiment, a recombinant vector according to the invention contains a polynucleotide coding for the RRS1-R polypeptide or a polypeptide having at least 40% identity in amino acids with the latter and retaining the biological activity of RRS1-R, placed under the control of regulatory sequence(s) enabling strong expression of RRS1-R or of its homologue in a selected host cell or plant. Such a recombinant vector is useful to enable a high level of expression of RRS1-R in a plant.

[0239] According to an advantageous embodiment, such a recombinant vector is an integrative vector enabling the insertion of multiple copies of the coding sequence of RRS1-R in the genome of a plant.

[0240] As an example, the bacterial promoters may be the promoters LacI, LacZ, promoters of the RNA polymerase of the bacteriophage T3 or T7, the promoters PR, or PL of the phage lambda.

[0241] Promoters for the expression of a nucleic acid of RRS1-R or RRS1-S according to the invention in plants are the promoter CaMV 35 S of the cauliflower mosaic virus of Odell et al. (1985), or the promoter of the gene of lactin 1 of rice, McElroy et al. (1990).

[0242] Other promoters useful to a person skilled in the art for the expression of a polynucleotide of interest in plants are described in the U.S. Pat. Nos. 5,750,866 and 5,633,363, incorporated herein by reference.

[0243] In general, for the choice of a suitable promoter, a person skilled in the art may advantageously refer to the book by Sambrook et al. (1989) cited above or the techniques described by Fuller et al. (1996), and Ausubel et al. (1989).

[0244] The preferred bacterial vectors according to the invention are for example the vectors pBR 322 (ATCC N^(o) 37017) or the vectors such as pAA223-3 (Pharmacia Uppsala, Sweden) and pGEM1 (Promega Biotech, Madison, WU, USA) and pUC19 (marketed by Boehringer Mannheim, Germany).

[0245] Other commercially available vectors include the vectors pQE70, pQE60, pQE9 (Qiagen, psuX 174, pBluescript SA, pNH8A, pMH16A, pMH18A, pMH46A, pWLNEO, pSG2CAT, pOG44, pXT1, pSG (Stratagene).

[0246] They may also be vectors of the baculovirus type such as the vector pVL1392/1393 (Pharmingen) used to transfect the cells of the line Sf9 (ATC N^(o) CRL 1711) derived from Spodoplera frugidera.

[0247] It is preferable to use vectors especially suitable for the expression of the sequence of interest in plant cells, such as the following vectors:

[0248] vector pBIN19 (Bevan et al., Nucleic Acids Research, Vol. 12: 8711-8721, marketed by the Company Clontech, Palo Alto, Calif., USA);

[0249] vector pBI101 (Jefferson 1987, Plant Molecular Biology Reporter, vol. 5 : 387-405, marketed by the Company Clontech);

[0250] vector pBI121 (Jefferson et al. 1987, Plant Molecular Biology Reporter, vol. 5 : 387-405, marketed by the Company Clontech);

[0251] vector pEGFP (Cormack BP et al., 1996, marketed by the Company Clontech);

[0252] vectors SLJ75515 and SLJ75516 (gift from Dr. J. Jones, The Sainsbury Laboratory, Norwich, UK);

[0253] vectors pDHB321 (gift from Dr. Bouchez, INRA Versailles, France);

[0254] vectors pAOV, pOV2, pSOV, pSOV2, pkMB and pSMB (Mylne et al., 1996).

[0255] In order that they can express the polynucleotides according to the invention, these vectors must be introduced into a host cell. The introduction of the polynucleotides according to the invention into a host cell may be performed in vitro, according to techniques well known to a person skilled in the art for transforming or transfecting the cells, either in primary culture, or in the form of cell lines.

[0256] A further object of the invention is a host cell transformed with a nucleic acid or by a recombinant vector according to the invention.

[0257] Such a transformed host cell is preferably of prokaryotic or eukaryotic origin, especially bacterial, fungal or plant.

[0258] Bacterial cells which may particularly be used are from different strains of E. coli or of Agrobacterium tumefaciens.

[0259] The transformed host cell is preferably a plant cell or a plant protoplast.

[0260] Even more preferably, it is a cell or a protoplast of rape, tobacco, corn, tomato, potato or of Arabidopsis thaliana.

[0261] The invention also relates to a transformed multicellular plant organism, characterized in that it comprises a transformed host cell or a multiplicity of host cells transformed with a nucleic acid of the RRS1-R or RRS1-S gene or with a recombinant vector according to the invention.

[0262] According to a first embodiment, the multicellular plant organism is transformed with one or more antisense nucleotides and/or one or more homopurine or homopyrimidine polynucleotides so as to inhibit or block the expression of the RRS1-S gene in this organism.

[0263] According to a second embodiment, the multicellular plant organism is transformed with one or more copies of a polynucleotide coding for the RRS1-R protein or for a polypeptide having at least 40% identity in amino acids with the RRS1-R polypeptide and retaining its biological activity, enabling it to provide a phenotype of resistance to R. solanacearum to the transformed organism.

[0264] A further object of the invention is a transgenic plant, in other words a transformed plant containing, preferably in a form integrated into its genome, a nucleic acid of the RRS1-R or RRS1-S gene and preferably an antisense polynucleotide or a homopurine or homopyrimidine polynucleotide or a nucleic acid coding for the RRS1-S polypeptide or a homologous polypeptide, said nucleic acid having been inserted into the genome of the plant by transformation with a nucleic acid of RRS1-R or RRS1-S or a recombinant vector according to the invention.

[0265] A transformed plant according to the invention is preferably a rape, tobacco, corn, tomato, potato or Arabidopsis thaliana plant.

[0266] According to a first embodiment, the transgenic plants such as defined above have a reduced, undetectable or absence of expression of the RRS1-S gene and are thus able to show increased resistance to pathogens such as R. solanacearum.

[0267] According to a second embodiment, the transgenic plants such as defined above have the property of strongly expressing the RRS1-R polypeptide and thus having a phenotype of resistance to various pathogens, and particularly to bacterial pathogens such as R. solanacearum.

[0268] A further object of the invention is a method for obtaining a transgenic plant transformed with a nucleic acid according to the invention, characterized in that it comprises the following steps:

[0269] a) obtaining a plant recombinant host cell such as defined above;

[0270] b) regenerating an entire plant from the transformed plant host cell obtained in step a);

[0271] c) selecting the plants obtained in step b) which have integrated the nucleic acid of the RRS1-R or RRS1-S gene of interest.

[0272] The invention also relates to a method for obtaining a transgenic plant, transformed with a nucleic acid according to the invention, characterized in that it comprises the following steps:

[0273] a) transforming a plant cell with a nucleic acid of the RRS1-R or RRS1-S gene or with a recombinant vector according to the invention;

[0274] b) regenerating an entire plant from cells of transformed plants obtained in step a);

[0275] c) selecting the plants which have integrated the nucleic acid of the RRS1-R or RRS1-S gene of interest.

[0276] The invention also relates to a method for obtaining a transformed plant, characterized in that it comprises the following steps:

[0277] a) obtaining a host cell of Agrobacterium tumefaciens transformed with a nucleic acid or a recombinant vector according to the invention;

[0278] b) transforming the selected plant by infection with the cells of Agrobacterium tumefaciens obtained in step a);

[0279] c) selecting the plants which have integrated the nucleic acid according to the invention.

[0280] Any of the methods described above for obtaining a transgenic plant may also contain the following additional steps:

[0281] d) crossing between themselves two transformed plants such as obtained in step c);

[0282] e) selecting the plants homozygous for the transgene.

[0283] According to another embodiment, any of the methods described above may also contain the following steps:

[0284] f) crossing a transformed plant obtained in step c) with a plant of the same species;

[0285] g) selecting the plants arising from the crossing of step d) which have retained the transgene.

[0286] A person skilled in the art is able to use many methods of the state of the art in order to obtain plants transformed with a nucleic acid of the RRS1-R or RRS1-S gene according to the invention.

[0287] A skilled person in the art may advantageously refer to the technique described by BECHTOLD et al. (1993) in order to transform a plant using the bacterium Agrobacterium tumefaciens.

[0288] The techniques used in other types of vectors may also be used, such as the techniques described by BOUCHEZ et al. (1993) or by HORSCH et al. (1984).

[0289] Another object of the invention is a transformed plant such as obtained by any of the methods for obtaining it described above.

[0290] The invention also relates to a plant seed whose component cells contain a nucleic acid of the RRS1-R gene or of the RRS1-S gene according to the invention which has been artificially inserted in their genome.

[0291] Another object of the invention is a seed of a transgenic plant such as defined above.

[0292] A further object of the invention consists of the use of a nucleic acid of the RRS1-R gene or of the RRS1-S gene according to the invention for in vitro or in vivo expression, preferably in planta of the RRS1-R or RRS1-S protein or of a peptide fragment of this.

[0293] The invention also relates to the use of an antisense nucleic acid, or of a homopurine or homopyrimidine nucleic acid according to the invention to inhibit or block the expression of the gene coding for the RRS1-R or RRS1-S protein.

[0294] The above uses are preferably characterized in that they consist of an in vivo expression in a plant transformed with such a nucleic acid.

[0295] As already stated above, the RRS1-R gene codes for a polypeptide of 1378 amino acids in length.

[0296] In addition, the RRS1-R polypeptide shows structural properties defined above in the specification, i.e. the characteristic presence of functional domains found in combination for the first time in the amino acid sequence of a polypeptide, which provides to the RRS1-R protein the status of being the first member of a new class of proteins defined in the most general way as proteins containing:

[0297] a) an N-terminal portion containing at least one sequence of amino acids rich in leucine and at least one nucleotide binding site; and

[0298] b) a C-terminal portion containing a DNA binding domain containing the amino acid sequence “WRKYGQK”.

[0299] The RRS1-R polypeptide is referenced as the sequence SEQ ID N^(o) 9.

[0300] As already noted, the applicant has shown that the mutations in the amino acid sequence of the RRS1-R polypeptide can lead to a protein devoid of the biological activity of the wild RRS1-R protein, the expression of the mutant protein in the plant no longer enabling the observation of the phenotype of resistance to certain pathogens, such as R. solanacearum. A representative polypeptide of the mutant polypeptides of RRS1-R modified in their biological function is the RRS1-S polypeptide whose amino acid sequence is referenced as the sequence SEQ ID N^(o) 10.

[0301] According to another embodiment, the invention also relates to a polypeptide coded by a nucleic acid of the RRS1-R gene or of the RRS1-S gene, and preferably a polypeptide containing at least 5 consecutive amino acids of a protein selected from RRS1-R and RRS1-S.

[0302] Such a polypeptide preferably contains at least 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 450, 500, 600, 700, 1 200 consecutive amino acids of the RRS1-R polypeptide of amino acid sequence SEQ ID N^(o) 9 or of the RRS1-S polypeptide of amino acid sequence SEQ ID N^(o) 10.

[0303] The invention also relates to a polypeptide containing amino acid sequences having at least 40% identity in amino acids with the sequence of the RRS1-R polypeptide SEQ ID N^(o) 9, with the sequence of the RRS1-S polypeptide SEQ ID N^(o) 10, or of a peptide fragment of these.

[0304] Advantageously included in the invention is a polypeptide having at least 60%, 80%, 85%, 90%, 95% or 99% identity in amino acids with the sequence of the RRS1-R polypeptide of SEQ ID N^(o) 9, with the sequence of the RRS1-S polypeptide SEQ ID N^(o) 10, or of a peptide fragment of these.

[0305] In general, the polypeptides according to the invention are in an isolated or purified form.

[0306] The invention also relates to a method for producing the RRS1-R polypeptide of sequence SEQ ID N^(o) 9, the RRS1-S polypeptide of sequence SEQ ID N^(o) 10 or a peptide fragment of these, the method comprising the steps of:

[0307] a) inserting a nucleic acid coding for the RRS1-R, RRS1-S polypeptide or a peptide fragment of these, into an appropriate vector;

[0308] b) culturing, in an appropriate culture medium, a host cell previously transformed or transfected with the recombinant vector of step a);

[0309] c) recovering the transformed host cell from the treated or lysed culture medium, for example by sonication or osmotic shock;

[0310] d) separating and purifying said polypeptide from the culture medium or cell lysates obtained in step c);

[0311] e) optionally, characterizing the recombinant polypeptide produced.

[0312] The peptides according to the invention may be characterized by fixation on an immunoaffinity chromatographic column on which antibodies directed against these polypeptides or a fragment or a variant of these polypeptides have been previously immobilized.

[0313] According to another embodiment, a recombinant polypeptide according to the invention may be purified by passage through a chromatography column according to methods known to a person skilled in the art and described for example by AUSUBEL F. et al. (1989) cited above.

[0314] A polypeptide according to the invention may also be prepared by conventional chemical synthesis techniques either in homogeneous solution or in the solid phase.

[0315] As an illustrative example, a polypeptide according to the invention 10 may be prepared by the homogenous solution technique described by HOUBEN WEYL (1974) or by the solid phase synthesis technique described by MERRIFIELD (1965a, 1965b).

[0316] Polypeptides referred to as homologues of the RRS1-R or RRS1-S polypeptides, or of their fragments, are also in the scope of the invention.

[0317] Such homologous polypeptides have amino acid sequences having one or more substitutions of an amino acid by an equivalent amino acid, compared to the reference polypeptide.

[0318] In the context of the present invention, equivalent amino acids should be understood to mean for example the replacement of a residue in the L form by a residue in the D form, or the replacement of a glutamic acid (E) by a pyro-glutamic acid according to techniques well known to a person skilled in the art.

[0319] As an illustration, the synthesis of peptides containing at least one residue in the D form is described by KOCH et al. (1977).

[0320] According to another embodiment, other examples of equivalent amino acids are two amino acids belonging to the same class, in other words two acid, basic, non-polar or polar uncharged amino acids.

[0321] Also in the scope of the invention is a polypeptide containing amino acid modifications of 1, 2, 3, 4, 5, 10 to 20 substitutions, additions or deletions of an amino acid compared to the amino acid sequence of the RRS1-R polypeptide or of the RRS1-S polypeptide according to the invention.

[0322] The polypeptides according to the invention preferably containing one or more additions, deletions, substitutions of at least one amino acid retain their ability to induce the resistance to various pathogens, and particularly to R. solanacearum.

[0323] According to another preferred embodiment, the polypeptides according to the invention containing one or more additions, deletions, substitutions of at least one amino acid retain their capacity to be recognized by antibodies directed against the unmodified RRS1-R or RRS1-S polypeptides.

[0324] A polypeptide derived from the RRS1-R protein or from the RRS1-S protein is particularly useful for the preparation of antibodies intended for the detection of the presence of one or the other of these polypeptides or of a peptide fragment of them in a sample.

[0325] In addition to the detection of the presence of the RRS1-R or RRS1-S polypeptide or of a peptide fragment of them in a sample, antibodies directed against these polypeptides are used to quantify the synthesis of RRS1-R or RRS1-S, for example in plant cells, and thus to determine the capacity of this plant to resist certain pathogens, particularly R. solanacearum.

[0326] The preferred antibodies according to the invention are antibodies specifically recognizing the amino acid sequence from the amino acid in position 704 to the amino acid in position 712 of the sequence of the RRS1-R polypeptide of SEQ ID N^(o) 9.

[0327] A second class of preferred antibodies according to the invention is the antibodies specifically recognizing the amino acid sequence from the amino acid in position 704 to the amino acid in position 710 of the sequence of the RRS1-S polypeptide of SEQ ID N^(o) 10.

[0328] A third class of preferred antibodies according to the invention is the antibodies specifically recognizing the amino acid sequence from the amino acid in position 1291 to the amino acid in position 1378 of the RRS1-R polypeptide of sequence SEQ ID N^(o) 9.

[0329] By “antibody” in the context of the present invention should be in particular understood polyclonal or monoclonal antibodies or their fragments (for example the fragments F(ab)′₂, F(ab)) or any polypeptide containing an initial antibody domain recognizing the target polypeptide or polypeptide fragment according to the invention.

[0330] The monoclonal antibodies may be prepared from hybridomas according to the technique described by KOHLER and MILSTEIN (1975).

[0331] The present invention also relates to antibodies directed against a polypeptide such as described above or a fragment or a variant of this, such as produced in the trioma technique or the hybridoma technique described by KOZBOR et al. (1983).

[0332] The invention also relates to single chain antibody Fv (ScFv) such as described in the U.S. Pat. No. 4,946,768 or by MARTINEAU et al. (1998).

[0333] The antibodies according to the invention also include the antibody fragments obtained using phage banks (RIDDER et al., 1995), REINMANN K. A. et al., 1997).

[0334] The antibody preparations according to the invention are useful in immunological detection tests intended for the identification of the presence and/or the quantity of the RRS1-R or RRS1-S protein or a peptide fragment of one of these proteins in a sample.

[0335] An antibody according to the invention may additionally contain a detectable isotopic or non-isotopic, for example fluorescent, marker, or may be coupled with a molecule such as biotin, according to techniques well known to a person skilled in the art.

[0336] A further object of the invention is thus a method for detecting the presence of a RRS1-R polypeptide, a RRS1-S polypeptide or a peptide fragment of one of these polypeptides according to the invention, in a sample, said method comprising the steps of:

[0337] a) placing the sample to be tested in contact with an antibody such as defined above;

[0338] b) detecting the antigen/antibody complex formed.

[0339] The invention also relates to a diagnostic pack or kit for detecting the presence of a polypeptide according to the invention in a sample, said kit comprising:

[0340] a) an antibody such as defined above;

[0341] b) optionally, a reagent allowing the detection of the antigen/antibody complexes formed.

[0342] The invention also relates to fusion proteins containing an amino acid sequence of one of the two C-terminal or N-terminal functional domains of a polypeptide according to the invention, on the one hand, and on the other hand all or part of the amino acid sequence of a heterologous polypeptide.

[0343] In the context of the present specification, “heterologous polypeptide” should be understood as an amino acid sequence which is not naturally present in the amino acid sequence of the polypeptide or of the N-terminal or C-terminal fragment of the polypeptide according to the invention with which this “heterologous” amino acid sequence is covalently bonded, preferably by a peptide bond.

[0344] According to a first embodiment, a fusion protein or chimera according to the invention contains (1) the N-terminal domain of a first polypeptide belonging to the new class of proteins of resistance to plant pathogens according to the invention, covalently bonded, preferably by a normal peptide bond, to a second polypeptide containing the C-terminal domain of a second protein also belonging to the class of proteins of resistance to plant pathogens according to the invention.

[0345] The N-terminal domain of the first polypeptide and the C-terminal domain of the second polypeptide may be directly bonded to each other by a peptide bond.

[0346] According to a second embodiment, the N-terminal domain of the first polypeptide and the C-terminal domain of the second polypeptide may be separated, within the fusion protein, by a spacer amino acid sequence which may be of any type.

[0347] Such a chimeric protein thus contains an N-terminal domain containing characteristic units of a protein of resistance to pathogens, covalently bound to a C-terminal domain containing a nucleotide binding site likely to have a different specificity to that of the C-terminal domain naturally found within the amino acid sequence of the natural polypeptide from which the N-terminal domain of the chimeric protein originates.

[0348] Such chimeric proteins constitute new means enabling, by suitable choice of the N-terminal domain, a resistance to a given pathogen to be provided to a plant.

[0349] In addition, the choice of the C-terminal domain for binding to nucleotides allows a person skilled in the art to target specifically the regulatory sequences which are activated by such a chimeric protein and thus to control both the type and level of resistance desired.

[0350] According to a second embodiment, a fusion protein according to the invention consists of (1) the amino acid sequence of all or part of a protein of which the mechanisms of induction in a plant are known and (2) (i) either the complete amino acid sequence of a polypeptide of resistance to pathogens according to the invention, or (ii) the amino acid sequence of the C-terminal domain containing a nucleotide binding site of a polypeptide according to the invention.

[0351] The synthesis of such a protein is preferably inducible by stress signals of the plant, such as a stress due to a sudden change in the temperature of the environment.

[0352] The invention also relates to a nucleic acid coding for a chimeric protein such as defined above.

[0353] According to a preferred embodiment, such a nucleic acid contains, close to and advantageously upstream of the coding region, a constitutive or inducible promoter.

[0354] As an illustration, such a nucleic acid contains, from the 5′ end to the 3′ end, a plant heat shock promoter protein, (2) a nucleotide sequence coding for the N-terminal portion of the heat shock protein whose synthesis is naturally regulated by the promoter (1), (3) the nucleotide sequence coding a polypeptide of resistance to plant pathogens which has retained its transcription factor function (WRKY functional domain) according to the invention.

[0355] As an example, the preferred promoters are those which are inducible by:

[0356] Heat (Athsp 17.6; Prandl et al., 1995);

[0357] H₂O₂, auxin, salicylic acid (GST6; Chen et al., 1996);

[0358] salicylic acid (PR1; Lebel et al., 1998).

[0359] A chimeric protein according to the invention may be obtained according to the technique described in example 7.

[0360] As has been described above, a polypeptide of resistance to plant pathogens according to the invention, has, both in the N-terminal domain and in the C-terminal domain, characteristic units of binding to proteins.

[0361] Proteins interacting with a polypeptide according to the invention thus also represent means involved in the resistance of plants to infection by certain pathogens, particularly by R. solanacearum.

[0362] An additional object of the invention is methods and means designed for the screening of substances or molecules which may be candidates for binding with a polypeptide according to the invention.

[0363] The invention thus relates to a method for screening a candidate substance fixing to a polypeptide according to the invention, and more particularly a RRS1-R or RRS1-S polypeptide, characterized in that it comprises the steps of:

[0364] a) preparing a polypeptide according to the invention;

[0365] b) obtaining a candidate substance to be tested;

[0366] c) placing the polypeptide of step a) in contact with the candidate substance of step b);

[0367] d) detecting any complex formed between the polypeptide of the invention and the candidate substance.

[0368] As an illustration, in a screening method as defined above, a polypeptide according to the invention is immobilized by adsorption or by covalent bonding on an appropriate surface, the surface of a microtitration plate well. The immobilized polypeptide of the invention is then placed in contact with the candidate substance or molecule to be tested.

[0369] The detection of the fixation of the candidate molecule on the immobilized polypeptide of the invention may then be performed for example with an antibody specifically recognizing the candidate molecule to be tested, said antibody containing optionally a detectable marker.

[0370] According to another embodiment of a screening method such as defined above, the candidate substance or molecule is previously immobilized on the surface of a solid phase before being brought into contact with a polypeptide according to the invention.

[0371] The detection of any complex formed between the polypeptide according to the invention and the candidate molecule immobilized on the solid phase may then be performed using an antibody specifically recognizing a polypeptide according to the invention, said antibody optionally being marked.

[0372] To perform such a screening method, a person skilled in the art may advantageously refer to the use of immunodetection techniques of the ELISA type.

[0373] The invention also relates to a kit or pack for screening a candidate substance fixing to a polypeptide of the invention, such kit or pack comprising:

[0374] a) a polypeptide according to the invention;

[0375] b) optionally, the reagents necessary for the detection of the complexes formed between the polypeptide of the invention and the candidate substance to be tested.

[0376] A method for screening molecules interacting with a polypeptide according to the invention may also be performed according to double hybrid techniques. Double hybrid screening techniques are used to study protein-protein interactions in vivo and are based on the fusion of a “bait” protein with the DNA binding site of the yeast protein Gal4.

[0377] The double hybrid technique is particularly described in the U.S. Pat. Nos. 5,667,973 and 5,283,173, the technical teaching of these two patents being incorporated herein by reference.

[0378] The screening of bands of cDNA containing inserts coding for translation products potentially interacting with a polypeptide according to the invention or a peptide fragment containing a domain binding with proteins of a polypeptide according to the invention may be performed as described by HARPER et al. (1993), by CHO et al. (1998), or by FROMONT-RACINE et al. (1997).

[0379] The invention also relates to a method for screening a candidate substance fixing to a polypeptide according to the invention, and more particularly to the RRS1-R or RRS1-S polypeptide, said method comprising the steps of:

[0380] a) obtaining a first nucleic acid coding for a fusion protein containing a part of the polypeptide of interest fused to the DNA binding site of a transcription factor protein such as Gal4;

[0381] b) obtaining a second nucleic acid coding for a fusion protein containing the candidate substance fused to the transcription domain of a transcription factor protein such as Gal4;

[0382] c) producing a nucleic acid containing a nucleotide sequence coding for a detectable marker, placed under the control of a regulatory sequence recognized by the transcription factor protein such as Gal4.

[0383] The nucleic acids a), b) and c) being inserted into appropriate vectors, and d) co-transfected with yeast simultaneously with said vectors;

[0384] e) detecting the expression of the nucleotide sequence coding for the detectable marker.

[0385] The screening method above may additionally include a step f) during which the nucleotide sequence and/or the amino acid sequence of the “prey” nucleic acid inducing the expression of the detectable marker is characterized.

[0386] The invention also relates to a kit or pack for screening a candidate substance fixing to a polypeptide according to the invention, and more particularly the RRS1-R or RRS1-S polypeptide, said kit or pack comprising:

[0387] a) a first nucleic acid coding for a fusion protein containing a portion of the polypeptide of interest fused to the DNA binding domain of a transcription factor such as Gal4;

[0388] b) optionally, a second nucleic acid containing a nucleotide sequence coding for a detectable marker, placed under the control of a regulatory sequence recognized by the transcription factor such as Gal4;

[0389] c) a third nucleic acid coding for a fusion protein containing the candidate substance fused to the transcription domain of a transcription factor such as Gal4.

[0390] The invention also relates to a substance able to fix to a polypeptide of the invention, preferably to the RRS1-R or RRS1-S polypeptide, this substance being characterized in that it is able to be obtained by a screening method such as that defined above.

[0391] A screening method making use of a system of the double hybrid type may be readily performed by a person skilled in the art, particularly according to the technique of example 8.

[0392] As already described above, a polypeptide according to the invention contains several nucleotide binding domains both in its N-terminal portion and in its C-terminal portion. The nucleotide sequences to which these different nucleotide binding sites bind thus assume a critical importance in the mechanisms of resistance of plants to different pathogens, and more particularly to R. solanacearum.

[0393] The isolation and characterization of the polypeptides of resistance to plant pathogens according to the invention now allow a skilled person in the art to determine these nucleotide sequences involved in the induction of a phenotype of resistance to pathogens in a plant.

[0394] The invention also relates to a method of screening a nucleic acid interacting with a polypeptide according to the invention, said screening method comprising the steps of:

[0395] a) obtaining a statistical population of nucleic acids of 20 to 50 nucleotides in length;

[0396] b) placing the statistical population of nucleic acids of step a) in contact with a polypeptide according to the invention;

[0397] c) characterizing the nucleic acid or acids interacting with said polypeptide.

[0398] According to such a method, a population of nucleic acids is first prepared, for example by direct chemical synthesis, which are from 20 to 50 nucleotides in length, and preferably from 20 to 30 nucleotides in length, whose nucleotide sequence is statistical, each of the synthetic nucleic acids being additionally covalently bound, at their 5′ or 3′ end, to a unique known nucleotide sequence.

[0399] The nucleic acids forming a statistical population of nucleic acids described above are then placed in contact with a polypeptide according to the invention, said polypeptide optionally being previously immobilized on a solid phase.

[0400] After a step of removing the nucleic acids which have not interacted with the polypeptide according to the invention, the polypeptide/nucleic acid complexes are placed in contact with an oligonucleotide primer specifically hybridizing, under given hybridization conditions, with the unique known nucleotide sequence common to all the nucleic acids of the statistical population above, then follows at least one amplification cycle using the primer hybridizing with the unique known nucleotide sequence of the nucleic acid retained by the polypeptide of the invention, for example an amplification by PCR.

[0401] According to a particular embodiment of this screening method, the nucleic acids thus amplified and isolated are again placed in contact with a polypeptide according to the invention, then the nucleic acid or acids which have formed a complex with said polypeptide are again hybridized with the nucleotide primer hybridizing with the unique known nucleotide sequence and a new cycle or cycles of amplification are performed.

[0402] The screening method such as defined above preferably includes from 1 to 10, and more preferably from 1 to 5 cycles of contact/amplification of the nucleic acids fixed onto the polypeptide of the invention.

[0403] The nucleic acids selected for their specific fixation to a polypeptide according to the invention are then characterized, preferably by sequencing.

[0404] The invention also relates a kit or pack for screening a nucleic acid interacting with a polypeptide according to the invention, comprising:

[0405] a) a polypeptide according to the invention;

[0406] b) optionally, a statistical population of nucleic acids of 20 to 50 nucleotides in length.

[0407] The invention is further illustrated, without in any way being limited, by the figures and examples.

[0408]FIG. 1 illustrates the amino acid sequence of the RRS1-R polypeptide. The different characteristic domains are represented.

[0409]FIG. 2 illustrates the amino acid sequence of the RRS1-S polypeptide. The different characteristic domains are represented.

[0410]FIG. 3 illustrates the representation and location of the clones YACS, BACS and cosmids covering the genomic region of interest of A. thaliana.

MATERIALS AND METHODS

[0411] a. Analysis by Southern Blot

[0412] The genomic DNA was isolated from leaves of plants and plantlets of Arabidopsis according to the technique described by Deslandes et al. (1998). The experiments of the Southern type were performed as described by Sambrook et al. (1989) using 2 μg of genomic DNA.

[0413] b. Extraction of the DNA from clones BAC, YAC, TAC, Cosmids, and from A. tumefaciens

[0414] The extraction of DNA from clones BAC and TAC was performed using a modification of the protocol supplied by the ABRC. 25 mL of a 12 hour culture of a BAC clone in the appropriate antibiotic were centrifuged for 5 min at 6 000 rpm. The sediment was resuspended in 4 mL of a buffer containing: 50 mM Glucose, 10 mM EDTA, 25 mM Tris, pH 8.0, 5 mg/mL of lyzozyme. After 5 min of incubation at ambient temperature, the cells were lysed by addition of 8 mL of 0.2 N NaOH, 1% SDS. A further incubation of 5 min at ambient temperature was followed by the addition of 6 mL potassium acetate 3M, pH 4.8. The mixture was incubated at 0° C. for 10 min., then centrifuged 15 min at 13 000 rpm. The supernatant was recovered, precipitated by addition of 0.7 volume of isopropanol, then centrifuged 15 min at 13 000 rpm. The sediment obtained was washed with ethanol 70%, dried 15 min at ambient temperature then resuspended in 0.6 mL of TE buffer (10 mM tris, pH 8, 1 mM EDTA).

[0415] The purification of the cosmid DNA was performed according to the protocol of Sambrook et al. (1989).

[0416] The extraction of the DNA from A. tumefaciens was performed following a protocol developed in the laboratory. 2 mL of a 12 hour culture in L medium supplemented with the appropriate antibiotics were centrifuged for 4 min at 13 000 rpm. The sediment was resuspended in 700 μl of buffer CTAB (100 mM Tris HCl, pH 8, 1.4M NaCl, 20 mM EDTA, pH 8, 2% w/v CTAB (hexadecyltrimethylammonium bromide) to which was added extemporaneously 0.07N of B-mercaptoethanol. The mixture was incubated for 10 min at 65° C., then extracted twice with an equal volume of phenol/chloroform (1V/1V) followed by an extraction with chloroform. The aqueous phase was then precipitated by 0.7 volume of isopropanol, the mixture centrifuged 10 min at 13 000 rpm and the sediment obtained was then resuspended in 50 μL of distilled water.

[0417] c. Plant Material and Protocol of Inoculation of Arabidopsis by R. solanacearum

[0418] The growth of Arabidopsis plants and the conditions of inoculation of these plants by different strains of R. solanacearum have been described by Deslandes et al. (1998).

[0419] d. Analysis of the DNA Sequences

[0420] The BLAST research programme (Altschul et al. 1990, 1997) was used for sequence analysis and comparison in the databases of Genbank, EMBL and Swissprot and in the databases on Arabidopsis (AatDB Database Arabidopsis).

EXAMPLE 1 Positional Cloning of RRS1

[0421] This cloning was performed on the sensitive ecotype Col-5 which contained the allele RRS1. The resistance to the strain GMI1000 of Ralstonia solanacearum was provided by the allele rrs1 of the resistant ecotype Nd-1.

[0422] A series of overlapping clones YACs (Yeast Artificial Chromosome) constructed by the Japanese consortium in charge of the programme of sequencing the chromosome V of Arabidopsis thaliana (http://www.kazusa.orjp/arabi/chr5/pmap/P1_map_(—)16.html) was obtained from the ABRC (Arabidopsis Biological Resource Center, Ohio State University, USA). RFLP markers with a polymorphism between the parent ecotypes Col-5 and Nd-1 were generated from the left and right ends of inserts of the YACs clones following the protocol of Schmidt et al. (1996). These different markers, EW7G12LE (LE, Left End: generated from the left end of the insert of the YAC), 11H2RE (RE, Right End: generated from the right end of the insert of the YAC), 11H2LE, 4H11RE as well as a marker T43968 (clone of cDNA positioned on the YAC CIC4E12 and obtained from the ABRC) were used as RFLP markers so as to localize more precisely the locus RRS1. Two of these markers (T43968 and EW7G12LE) were shown to be polymorphic with the parent ecotypes and allowed restriction of the region containing the RRS1 locus to a region of about 600 kb. On the 18 remaining recombinant plants, only 11 lines still showed a recombination event between these 2 markers and the RRS1 locus.

[0423] In addition, the markers generated from the ends of the YACs clones were used to perform the screening of a TAMU bank (Texas A & M University) of BACs (Bacterial Artificial Chromosome), a bank obtained through the ABRC. This screening, performed following conditions given by the ABRC, led to the isolation of a substantial number of BACs clones. Markers were then generated from the ends of some of these BACs by of “plasmid rescue” and inverse PCR techniques (Woo et al., 1994). These markers, 1H19LE, 2G14LE, 21F10LE, 9N23LE, 27P17LE and 29D23L E were used to position the different selected BACs with respect to each other (by digestion of the DNA of the different BACs and hybridization with each of the isolated ends). These data could be used to construct a series of overlapping BACs clones. Finally, we made use of the existence of a series of overlapping TACs (Transformation Artificial Chromosome) clones generated by the Japanese consortium. The corresponding TACs clones (K9E15, K18C1, K15122, K2N11) and the MFC19 clone (bank P1, Liu et al., 1995) were obtained by the ABRC.

[0424] In parallel with these studies, the number of lines showing a recombination event being relatively low, 650 F2 plants originating from a cross between the 2 parent ecotypes were tested so as to characterize other lines showing such an event between the T43968 and EW7G12LE markers and the RRS1 locus. With this aim, the DNA of several leaves of each F2 plant was extracted (Deslandes et al., 1998) and digested by the restriction enzyme BglII which allowed observation of a polymorphism with the T43968 and EW7G12LE markers on the 2 parental genotypes. The F2 plants thus selected were then self-fertilized and the seeds obtained (F3) were sown. The phenotype of these third generation plants was then determined by inoculation of the strain GMI1000. This approach enabled recombinant 15 F3 families between the T43968 and EW7G12LE markers to be obtained.

[0425] The markers corresponding to the ends of BACs were then used as RFLP markers on the 11 RILs lines and on the 15 F3 families. The sub-cloning of BACs and of cosmids by different restriction enzymes moreover allowed generation of other RFLP markers (Table 5). TABLE RFLP Markers used to map the RRSI locus over 15 F3 families originating from the Col-5 and Nd-1 crossing. Markers 18 32 149 195 251 324 328 332 364 380 401 428 446 540 566 T43968 AB A AB A AB AB A AB A AB B AB A AB AB 1H19 n° 1 AB AB A AB AB A AB AB AB B AB AB AB AB #419 AB A A AB AB A AB AB AB B AB AB AB AB B1 B AB A — AB AB A AB AB AB B AB AB AB AB B1 n° 4 — AB A A AB AB A AB AB — B AB AB AB AB #6.1 — AB A A AB AB A AB AB AB B AB AB AB AB RRS1 B AB A — AB AB A AB AB AB B AB AB AB A MFC5 — — — — — AB/A AB/A — — AB AB — — — — MFC61 B AB A A — A AB B — AB AB — — B — MFC14 — — — — — A AB/B — — AB/B — — — — A 9N23 N° 2 B AB A A AB — AB AB — B AB AB AB AB — 9N23 N° 18 B? AB A — — — — — — — — A — — — 2G14LE B AB A A AB — AB B — B AB — — B — EW7G12LE B AB A AB B B AB B AB B AB A AB B A

[0426] These experiments enabled the RRS1 gene to be localized on 3 overlapping BACs clones (T29K4, T25P9 and MFC19), covering a region of about 170 kb (FIG. 3).

[0427] Contigs of cosmids covering the whole of the 2 BACs were obtained by partial digestion by the BamHI enzyme and sub-cloning of the DNA of the 2 BACs in the cosmid vector SLJ75515 (gift from Dr. J. Jones, The Sainsbury Laboratory). The use of these cosmids and of sub-clones of these cosmids obtained by digestion with the enzyme HindIII and insertion into the vector pBlueScript enabled RRS1 to be localized on a cosmid of 18 Kb, the cosmid B1 derived from the clone BAC T29K4. A bank of cDNA of the ecotype Col-0 prepared in the vector IZAP (gift from B. Lescure of our Institute) was then screened with the DNA of the insert of the cosmid B1 and enabled isolation of several clones of cDNA whose nucleotide sequence was determined. At the same time, the nucleotide sequence of a TAC clone, K9E15 which covers the cosmid B1, was added to the data banks by the Japanese consortium. Some cDNA clones whose nucleotide sequence was known could thus be localized on the cosmid B1. One of these, named #6.1, used as RFLP markers as well as a CAPS marker, B69.1, enabled delimitation of a region of about 10 kb of the cosmid B1 containing the RRS1 locus. Given the sequence of the cosmid B1, it was possible to generate several pairs of primers so as to identify the PCR markers of the CAPS type. Two recombinant F3 plants, 324 and 566, showing a recombination event within the cosmid B1, allowed identification of the RRS1 gene.

EXAMPLE 2 Isolation and Characterization of RRS1

[0428] A bank of cosmids prepared from the DNA of the ecotype Nd-1 was constructed in the vector SLJ75515 by the group of J. Beynon (Horticultural Research International, Wellesbourne, GB). This bank was kindly given to us and its screening was performed (according to the hybridization conditions of the Southern type described by Deslandes et al., 1998) using the DNA of the clone T1H19 which contains the RRS1 locus. Several cosmid clones were isolated and characterized. One of these, the clone H, was selected since it showed a restriction profile identical with that of B1, the cosmid on which RRS1-S had been localized by positional cloning. The sequence of the RRS1-S gene being known, it was possible to generate oligonucleotide primers which enabled the nucleotide sequence of the region corresponding to RRS1 of the cosmid H to be determined. The determination of the nucleotide sequence was performed on a Perkin-Elmer 373 sequencer (kit Dye Terminator) and the different sequences were assembled using the “Staden” programs for sequence assembly, available on a UNIX station (Roger Staden, MRC, Cambridge, UK). The different primers used were the primers of the “K” series, i.e. all the primers whose names begin with the letter “K” (see Table 6).

EXAMPLE 3 Complementation of the Sensitive and Resistant Ecotypes by RRS1-R and RRS1-S

[0429] a. Constructions

[0430] The cosmid vector SLJ 75515 of clones H and B1 has the advantage of being directly transferable to Agrobacterium tumefaciens. This transfer of cosmids of E. coli (strain XL1Blue for the cosmid B1 and strain DH12S for the cosmid H) into A. tumefaciens (strain GV3 101 containing the helper plasmid pMP90) was performed by triparental conjugation using a third strain, pRK2013 (E. coli) following the protocol of Tolmasky et al. (1984). The colonies of A. tumefaciens containing the cosmids H or B1 were selected on an L agar medium supplemented with tetracycline (10 μg/ml) and gentamycin (25 μg/ml). The dishes were then incubated at 28° C. for 3 days.

[0431] The presence of the cosmid DNA was checked by PCR amplification using specific primers of inserts of the clones H and B1 (pair RT1/RT2 for example). In addition, experiments of the Southern type enabled the stability of the cosmid DNA in A. tumefaciens to be verified.

[0432] The insert of the cosmid B1 was sub-cloned using the enzyme BamH1 which generated two fragments: 9.68 and 8.03 kb. For the cosmid H, 2 BamHI fragments of comparable size to those of the cosmid B1 were obtained. The BamHI site was only present once in the genomic sequences of RRS1-R and RRS1-S. Each of the 2 sub-fragments derived from the cosmids B1 and H was introduced into the binary vector pDHB321.1 supplied by Dr. Bouchez (INRA Versailles). After transformation in a strain of E. coli (XL1BlueMR, Stratagene, USA), the plasmid DNA was extracted (Maniatis 1978). The strain of A. tumefaciens GV3101 was then transformed by the heat shock technique (Holsters et al., 1978). The colonies of A. tumefaciens containing the sub-clones of the cosmids B1 and H were then selected on L agar medium containing gentamycin (25 μg/ml) and Kanamycin (50 μg/ml). The dishes were then incubated at 28° C. for 48 h.

[0433] The presence of the plasmid DNA was checked by PCR amplification using specific primers of the 2 sub-fragments of the clones H and B1. In addition, experiments of the Southern type enabled the stability of the plasmid DNA in A. tumefaciens to be verified. These constructions were then introduced into the genome of the plants Col-5 and Nd-1.

[0434] b. Transformation of the Ecotypes Nd-1 and Col-5 by Agrobacterium tumefaciens

[0435] In order to demonstrate the function of the candidate genes, complementation experiments of the sensitive and resistant phenotypes by the cosmids H and B1 were performed. The conventional techniques of transformation of Arabidopsis by A. tumefaciens (Clough and Bent 1999) were used to generate the transgenic plants described in this work. The seeds of plants transformed by A. tumefaciens (generation T1) were planted in pots containing a layer of perlite (about 2 cm) covered with a layer of sand of the same thickness soaked with a solution of the gluphosinate herbicide (trade name: BASTA) at a concentration of 15 mg/L. The pots were then placed for 4 days in the dark at 4° C., then transferred to long-day conditions (16 h of day/8 h of night). After about 10 days, the resistant plants were selected and repotted individually. The T2 progeny of each transgenic T1 plant obtained by self-fertilization was used for the experiments of inoculation by the pathogenic agent.

[0436] c. Results Obtained

[0437] c.1 The RRS1-R gene provided resistance to strain GMI1000 of Ralstonia solanacearum.

[0438] Complementation experiments performed using the cosmid H containing the RRS1-R allele introduced into the ecotype Col-5 were carried out. After selection by the herbicide BASTA, 12 transgenic plants could be isolated (lines numbered CH1.1 to CH1.12). These plants were self-fertilized so as to obtain the T2 progeny of each of them and 24 individuals of each transgenic line obtained were inoculated with strain GMI1000 of R. solanacearum. While the control plants (Col-5) had wilted 7 days after inoculation with the pathogen, the transgenic plants containing the RRS1-R allele of Nd-I showed no symptoms of wilting. These results show that the RRS1-R gene can provide resistance to the strain GMI1000 of R. solanacearum in plants with genetic base Col-5. The expression of RRS1-R could lead to a reduction of the bacterial multiplication in the infected plant, which could explain the absence of symptoms.

[0439] c.2 The RRS1-R Gene Provided Resistance to Different Strains of Ralstonia solanacearum.

[0440] Some strains of R. solanacearum induce responses (disease or absence of symptoms) identical to those of the strain GMI1000 in the ecotypes Nd-1 and Col-5. These isolated strains of very diverse plant-hosts (cf. table) originate from different regions of the globe. The capacity of the RRS1-R gene to provide resistance to these different strains was thus tested. Col-5 transgenic plants containing the RRS1-R gene were thus infected by these different strains. While the control plants (Col-5) had wilted 7 days after inoculation with the pathogen, the transgenic plants containing the RRS1-R allele of Nd-1 showed no symptoms of wilting. This result shows that the RRS1-R gene is able to provide resistance to different strains of R. solanacearum.

[0441] c.3 Does the RRS1-S Gene Provide Sensitivity to Strain GMI1000 of R. solanacearum?.

[0442] The response of Nd-1 plants transformed by the RRS1-S gene of the Col-5 sensitive ecotype was tested. 104 Plants were selected for their resistance to BASTA (B1.1 to B1.104) and self-fertilized. These transgenic plants were inoculated by the strain GMI1000. Only one transgenic line (B1.3) developed symptoms of wilting while the other lines showed no symptoms under conditions inducing complete wilting of control Col-5 plants.

EXAMPLE 4 Experiments of Complementation by Genomic Clones RRS1-S and RRS1-R.

[0443] A SphI digestion of the cosmid B1 gave a fragment of 9 393 pb (fragment between base 6557 and base 15950, based on the numbering of the sequence of the clone BAC K9E15 published by the Japanese consortium) covering the whole of the RRS1-S gene (promoter, introns/exons, polyadenylation signal sequence). This SphI fragment was ligated into a plasmid pUC19 digested by the SphI enzyme and dephosphorylated. This construction was then introduced into the strain DH5α of E. coli. This clone was defined as being the genomic clone RRS1 and was named B1puc3.

[0444] The same type of experiment was performed from the cosmid H. The genomic clone obtained was named Hpuc2.

[0445] In parallel, the binary vector pDHB321.1 was digested by the enzyme BamHI (unique cloning site). Ligation of the inserts (SphI/SphI) of the clones B1puc3 and Hpuc2 (derived from a SphI digestion) in the binary vector (BamHI/BamHI) thus required use of an adaptor so as to ligate the BamHI and SphI ends to each other. Two oligonucleotide primers were involved in the synthesis of this adaptor: it was composed of an equimolar mixture (5 μM) of the primers named “BamSph” (5′-GAT CGC GGC CGC CAT G-3′) and “Not” (5′-GCG GCC GC-3′). The mixture was subjected to a temperature of 95° C. for 3 minutes and then allowed to return to ambient temperature naturally.

[0446] To 100 ng of vector pDHB321.1 digested by the enzyme BamHI, 1 μl of the solution of the adaptor of 5 μM concentration was added. The ligation of the linearized vector and of the adaptor took place in a final volume of 10 μl for 12 hours at 4° C. The reaction was then stopped by dilution (addition of 20 μl of water). The excess of adaptor was then removed by passage of the ligation mixture over a microspin S400 column (Pharmacia, ref).

[0447] The DNA of the clones B1puc3 and Hpuc2 was digested by the enzyme SphI. After inactivation of the enzyme at 65° C. for 20 minutes, an aliquot of each digestion (about 100 ng) was used to perform the ligation with the vector pDHB321.1+adaptor. These constructions were then introduced into strain DH5α of E. coli and selected on an L agar medium containing 50 μg/ml of kanamycin (resistance provided by the binary vector): the clones which contained the insert derived from the clones B1 and H (about 9 400 pb) were named B1MT9 and HMTB respectively. The plasmid DNA extracted from these clones was then used to transform strain GV3101 of A. tumefaciens by heat shock. The colonies were selected on an L agar medium containing 25 μg/ml of gentamycin (resistance provided by the helper plasmid pMP90) and 50 μg/ml of kanamycin. The colonies of A. tumefaciens transformed by the DNA of the clones B1MT9 and HMTB were named B1MT9.1 and HMTB2 respectively.

EXAMPLE 5 Expression of the RRS1-S and RRS1-R Genes by Experiments of the Northern Type

[0448] The expression of the RRS1-S and RRS1-R genes was studied in non-infected plants by experiments of the Northern type. After extraction of the total RNA by the technique described by Lummerzheim (1993), the polyadenylated RNAs were purified by use of the Dynabeads Kit (Dynal, USA). After determination at 260 nm, the mRNAs were deposited on gel under denaturing conditions (Marco et al. 1990), then transferred onto a membrane of nitrocellulose (Hybond N+, Amersham Pharmacia Biotech). The hybridization was performed following the protocol given by the supplier.

EXAMPLE 6 Production of Full-Length cDNA Clones by RACE PCR

[0449] The production of full-length cDNA clones required use of a RACE (Rapid Amplification of cDNA Ends) PCR kit. The details of the experimental protocols given below are valid for the clones of cDNA derived from the ecotypes Col-5 and Nd-1.

[0450] a. RACE PCR (Rapid Amplification of cDNA Ends)

[0451] The 5′ and 3′ ends of the cDNA clones were generated by RACE PCR by use of the kit “Smart Race cDNA Amplification” (Clontech, USA). The total RNAs were extracted from two-week-old plantlets of the ecotypes Col-5 and Nd-1 following the protocol of Lummerzheim et al. (1993). The reverse transcriptase reactions were performed on 1 μg of total RNA following the conditions given by the manufacturer. The first strands of the cDNA were synthesized using the following primers:

[0452] 5′CDS and SMART II oligo (Clontech) for obtaining the 5′ ends

[0453] 3′CDS (Clontech) for obtaining the 3′ ends

[0454] The oligonucleotide primers used for the amplification of the 5′ ends of the cDNA RRS1 (Col-5) and rrs1 (Nd-1) were:

[0455] the universal primer UPM (Universal Primer Mix, kit Smart Race)

[0456] the antisense primer RT4, specific for the cDNA RRS1-S/RRS1-R (see table 2).

[0457] The oligonucleotide primers used for the amplification of the 3′ ends of the cDNA RRS1 (Col-5) and rrs1 (Nd-1) were:

[0458] the universal primer UPM (Universal Primer Mix, kit Smart Race)

[0459] the primer RT6, specific for the cDNA RRS1-S/RRS1-R (see table 2).

[0460] The RT4 and RT6 primers were selected so that the amplification products of the 5′ and 3′ ends had in common a region of 500 pb containing a unique BamHI restriction site.

[0461] The conditions of the amplifications of the 5′ and 3′ ends were the following:

[0462] 5 cycles (94° C. for 30 seconds, 72° C. for 5 minutes)

[0463] 5 cycles (94° C. for 30 seconds, 70° C. for 30 seconds, 72° C. for 5 minutes)

[0464] 30 cycles (94° C. for 30 seconds, 68° C. for 30 seconds, 72° C. for 5 minutes)

[0465] The size of the amplification products of the 5′ and 3′ ends were 3 708 pb and 1 230 pb for Col-5; 3 714 and 1 231 pb for Nd-1 respectively. These PCR products were cloned in the vector pGemT-easy (Promega). The sequencing of these clones enabled precise determination of the beginning (5′) and end (3′) of the untranslated transcribed region of the cDNA clones derived from the ecotypes Col-5 and Nd-1. The size of the full-length cDNA clones derived from the plants Col-5 and Nd-i was respectively 4 336 and 4 343 pb.

[0466] b. Production of Full-Size cDNA Clones

[0467] Full-size cDNA clones were generated (i.e. 4 336 and 4 343 pb) by performing a double digestion BamHI (single site) and NotI (site present at the multiple cloning site of the vector pGemT-easy) on a mixture of 2 plasmids with the object of excising the inserts corresponding to the 5′ and 3′ ends. The 5′ and 3′ inserts were then ligated together in the plasmid pGemT-easy digested by NotI and dephosphorylated. After transformation of bacteria DH5α with the ligation mixture and spreading on selective medium (medium L+ampicillin 50 μg/mL+0.5 mM IPTG+80 μg/mL X-Gal), the positive clones containing the full-size cDNA clone were selected by performing a PCR screening on colonies using primers K3.5 and RT2 (see table 6) which are specific to each of the 5′ and 3′ ends respectively.

[0468] A second RACE PCR experiment was performed on the first strands of cDNA produced by the reverse transcriptase reaction described above with the object of introducing two SalI restriction sites, one being located just upstream of the initiation codon ATG (point of initiation of the translation), the other being located after the polyadenylation signal. The two pairs of primers used to amplify the 5′ and 3′ ends were 5′RRSalI/RT8 and 3′RRSalI/RT7 respectively.

[0469] The amplification conditions, the cloning of the PCR products and the method of obtaining the full-length cDNA clones were the same as those described above where the unique restriction site common to the 5′ and 3′ ends was a AflII site.

[0470] The introduction of SalI sites on either side of the sequence of the cDNA clone was in order to facilitate its cloning in different types of vectors.

EXAMPLE 7 Domain Exchange Experiment by Production of Chimeric RRS1-S/RRS1-R Clones

[0471] The plasmid DNA used to perform the domain exchange experiments originated from the genomic clones B1puc3 and Hpuc2 described above. These 2 plasmids were digested by the enzyme BamHI whose restriction site was localized in the region showing similarities with a resistance gene (5th exon). This digestion generated 2 fragments, one of 4 881 pb (clone B1puc3 containing the RRS1-S gene) containing the 5′ portion of the genomic clone (promoter and coding portion of the RRS1-S gene up to the level of the 5th exon) and a fragment of 7 180 pb (clone B1puc3) containing the 3′ portion of the genomic clone (4 512 pb containing the end of the region showing similarities with the resistance gene, the region similar to the gene coding a transcription factor of type WRKY and the 3′ untranslated transcribed region; 2 668 pb of plasmid vector pUC19).

[0472] The size of the fragments generated by the BamHI digestion of the plasmid Hpuc2 was of the same order of magnitude as that of the fragments obtained by BamHI digestion of the plasmid B1puc3.

[0473] After digestion by the BamHI enzyme, an aliquot of each of the DNA fragments generated was dephosphorylated. The residue was deposited on gel so as to purify the fragments corresponding to the 5′ portion of each genomic clone. The fragment of DNA corresponding to the 5′ portion of the RRS1-S gene was then ligated with the 3′ portion of the RRS1-R gene. Similarly, the fragment of DNA corresponding to the 5′ portion of the RRS1-R gene was ligated with the 3′ portion of the RRS1-S gene. These constructions were then introduced into E. coli (strain DH5α) and were respectively named 3BH (5′RRS1-S/3′RRS1-R) and 6HB (5′RRS1-R/3′RRS1-S). Specific primers of the 5′ and 3′ regions of the RRS1-S and RRS1-R genes were used to verify these constructions. The DNA of each plasmid was purified, digested by SphI, an enzyme allowing the excision of the complete genes. The inserts were then ligated into the plasmid vector pDHB321.1 digested by the enzyme BamHI and whose ends had been rendered compatible with the SphI ends using an adaptor (see above). These constructions were successively introduced into strain DH5α (the clones were named 3BHMT7 (5′RRS1-S/3′RRS1-R) and 6HBMT2 (5′RRS1-R/3′RRS1-S)) then into strain GV3101 of A. tumefaciens (clones named 3BHMT7.1 and 6HBMT2.1).

EXAMPLE 8 Search for Proteins Interacting with Proteins rrs1 and RRS1 by the Double Hybrid Technique in Yeast

[0474] In order to identify the proteins interacting with the RRS1-S and RRS1-R proteins, the double hybrid system in yeast was used (kit Clontech, Matchmaker Gal4 Two-Hybrid System 3). The complete cDNA clones corresponding to the RRS1-S and RRS1-R genes and the sub-fragments corresponding to certain specific domains of these genes (domains TIR, NBS, LRR, WRKY) were used to screen cDNA banks with the object of isolating genes whose products interacted with the products of these DNA fragments.

EXAMPLE 9 Modulation of the Expression of the RRS1-S and RRS1-R Genes by Sense and Antisense Approaches

[0475] For the antisense approaches, two different constructions were performed.

[0476] The first consisted of a fragment of 299 pb (position 3 642 to 3 941 of the sequence SEQ ID N^(o) 8) fragment corresponding to the domain WRKY of the protein RRS1-1. PCR primers (B1AS5′ and B1AS3′ which allowed the introduction of a NcoI and SalI site respectively) were generated to amplify this DNA fragment. The amplification was performed on single-strand cDNA resulting from the “reverse transcription” of total RNA of leaves of two-week old Col-5 plants. The polymerase used was Deep Vent (Biolabs, New England) and the conditions of amplification were those described by the manufacturer. The amplification product B1AS5′/B1AS3′ was digested by the enzymes NcoI and SalI and the digestion product deposited on gel and purified (Kit Jetsorb, Quantum Appligene). This DNA fragment was then ligated into the vector PA1 proterm 2 (gift from Pr. Lescure, our Institute) digested by the enzymes NcoI and SalI. This construction was introduced into the strain DH5α of E. coli. The corresponding plasmid DNA was purified, digested by the enzyme BamHI, which allowed the excision of an insert containing the promoter of the gene EF1a, the 299 pb of the RRS1-S gene in antisense orientation and the terminator of the gene EF1a (Axelos et al., 1989). This BamHI fragment was then inserted by ligation into the binary vector pDHB321.1 digested by BamHI. This new construction was introduced into strain DH5α of E. coli. The purified plasmid DNA was used to transform strain GV3101 of A. tumefaciens.

[0477] The second antisense construction was performed by excising an EcoRI fragment of the insert of the clone of full-length RRS1-R cDNA (sequence SEQ ID N^(o) 4). The DNA fragment thus generated (size about 3 930 pb) was introduced into the vector pKMB (Mylne and Botella, 1998) digested by EcoRI. This vector contained the 35S promoter of the cauliflower mosaic virus (CaMV 35S) and the corresponding terminator. This construction was successively introduced into the strains DH5α of E. coli and GV3 101 of A. tumefaciens by triparental conjugation.

[0478] The strains of A. tumefaciens obtained (29B iASMT2: fragment of 299 pb and J1: fragment of 3 930 pb) were used for the transformation of the ecotypes Nd-1 and Col-5. TABLE 6 SEQ ID Oligonucleotides Sequence(5′ --3′) SEQ ID no 11 K0 AGC TCG AGA CTA TTC AGG SEQ ID no 12 K1.3 AAA CAC TGA TAG CTA ACG G SEQ ID no 13 K1.5 ATC TCT AAC GGT GGA TGG SEQ ID no 14 K2.3 TGC ATT CAA GAC CTC TAG G SEQ ID no 15 K2.5 CCG TTA GCT ATC AGT GTT T SEQ ID no 16 K3.3 AGT CAT CAA GTG ACC ATC SEQ ID no 17 K3.5 CTA GAG GTC TTG AAT GCA C SEQ ID no 18 K4 GCA TCA CAG TAG TCC TCG SEQ ID no 19 K5 ACA TCC AAG TCA ATA CCG G SEQ ID no 20 K6 GCC AAT AGA GAT GTA CCA SEQ ID no 21 K7 TGG TAC ATC TCT AAT GGC SEQ ID no 22 K8 AGT AAC ACG TAA TGT AAC C SEQ ID no 23 K9 ACC AGC AAG TTT AGG ATG A SEQ ID no 24 K10 GAT GGT CAC TTG ATG ACT SEQ ID no 25 K11 GGT GTA CAT AAA TCC TTG G SEQ ID no 26 K12 CCA AGG ATT TAT GTA CAC C SEQ ID no 27 K13 ACT CTT ATG GAG ATG CTC SEQ ID no 28 K14 CGC ATC CTT AAA CTA CTG SEQ ID no 29 K15 ATA TCT CCG GTT TCA ACC SEQ ID no 30 K16 CCT TGG TGA GTA GCT CAC SEQ ID no 31 K17 CCA TAG ATC TCC CTC GTC SEQ ID no 32 K18 CCT TAT AGA ACT TCT CTC C SEQ ID no 33 K19 CTC TTC GAG TGC ATC AGG SEQ ID no 34 K20 CCG GTA TTG ACT TGG ATG T SEQ ID no 35 K21 AGA TAC ACG TAC ACT GGC SEQ ID no 36 K22 TCC AGC CCA GAT ATC AGG SEQ ID no 37 K23 TGC ATA GGA AGC TTC TCC SEQ ID no 38 K24 TTC AGA GGA ACT TGA GCG SEQ ID no 39 K25 CCA AGC AAA TAA GCT TCC C SEQ ID no 40 K26 ATC GTC CTC AAC ATC TCC SEQ ID no 41 K27 AGG CGC AGA AGA CTG TGG SEQ ID no 42 K28 TTG ATG CTC CAA GGT TCC SEQ ID no 43 K29 GAG ATG TGT ACG AGA CGC SEQ ID no 44 K30 CAA TCT CCA GCA GCT TCG SEQ ID no 45 K31 TTG AGT GGT TGA ATG TCC SEQ ID no 46 K32 CAC ACG AAT TCC TCA TCC SEQ ID no 47 K33 TGA AGG AAC ACT CGT TGC SEQ ID no 48 K34Nd GTC TTT CAG AGG CCT CGA SEQ ID no 49 K35Col GGT AAG CAA TCT CTG ATA SEQ ID no 50 RT1 ATG TTA TAT CGA CGT TGG SEQ ID no 51 RT2 GAG GAA GTG GAA CGA GTG SEQ ID no 52 RT3 AAC TCC TCC ATG TCC GTC SEQ ID no 53 RT4 ATC TCC CTC GTC TAT AGC CGG TAT GG SEQ ID no 54 RT5 GAT CAG GCT TCC GGG TCC TAG CCA GTC SEQ ID no 55 RT6 AGT GAT GTC AAC ATG CGC CCA AGT ACC SEQ ID no 56 RT7 AAC CTT CAA ACG TGC TCA GGG CTC TG SEQ ID no 57 RT8 ACA TCT CCA GGT TCT TGG TTC CAC CC SEQ ID no 58 5′RRSalI GTC GAC ATG ACC AAT TGT GAA AAG GAT GA SEQ ID no 59 3′RRSalI GTC GAC CTT GTC TTG CAG TGA TGA GAG SEQ ID no 60 B1AS5′ CTA TTC CAT GGA GGA GGA AGT GG SEQ ID no 61 B1AS3′ TTA GTC GAC GAA GAA GAA ACA TAG

BIBLIOGRAPHIC REFERENCES

[0479] AXELOS, M., Bardet, C., Liboz, T., Le Van Thai, A., Curie, C. and Lescure, B. 1989. The gene family encoding the Arabidopsis thaliana translation elongation factor EF-1a: molecular cloning, characterization and expression. Mol. Gen. Genet. 219, 106-112.

[0480] ALSCHUL, S. F., Gish, W, Miller, W., Myers, E. W., and Lipman, D. 1990. Basic local alignment search tool. J. Mol. Biol. 215, 403-410.

[0481] ALSHUL S. F et al., (1997), Nucleic Acids Research, vol. 25: 33896-3402.

[0482] AUSUBEL et al., (1989), Current Protocols in Molecular Biology, Green Publishing Associates and WLILEY Interscience, New-York.

[0483] AUSUBEL et al. Eds., John WILEY and BROWN E. L., (1979), Methods Enzymol vol. 68: 109-151,

[0484] BEAUCAGE et al., (1981), Tetrahedron Lett., vol. 22: 1859-1862

[0485] BOUCHER, C. A., Barberis, P. A., Trigalet, A. P. and Demery, D. A. 1985. Transposon mutagenesis of Pseudomonas solanacearum: isolation of Tn5-induced avirulent mutants. J. Gen. Microbiol. 131, 2449-2457.

[0486] BECHTOLD N., ELLIS J., and PELLETIER G (1993) C.R. Acad. Sci. Pris 316, 1194-1199.

[0487] BROWN E L. et al., (1979) Methods Enzymol., 68: 109-151.

[0488] BEVAN et al., Nucleic Acids Research, 12: 8711-8721.

[0489] BOUCHER et al., 1993.

[0490] CHEN W., CHAO G., SINGH K B (1996). Plant J. 10, 955-966.

[0491] CLOUGH, S., and Bent, A. 1999. Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 16, 735-743.

[0492] CHO R. J. et al. (1998), Proc. Natl. Acad. Sci. USA, Vol.95 (7):3752

[0493] CORMACK B, VALDIVIA R. H. FALKOW S. 1996. Gene 173, 33-38.

[0494] DESLANDES, L., Pileur, F., Liaubet, L., Camut, S., Can, C., Williams, K., holub, E., Beynon, J., Arlat, M. and Marco, Y. 1998. Genetic characterization of RRS1, a recessive locus in Arabidopsis thaliana that confers resistance to the bacterial soilborne pathogen, Ralstonia solanacearum.

[0495] FULLER S. A. et al. (1996), Immunology in Current Protocols in Molecular Biology.

[0496] FROMONT-RACINE M. et al. (1997), Nature Genetics, vol. 16 (3): 277-282.

[0497] GREEN et al., (1986), Ann. Rev. Biochem., vol. 55 : 569-597.

[0498] GRIFFIN et al. (1989), Science, vol. 245 : 967-971.

[0499] HOUBEN WEYL, (1974), In Methoder Organischen Chemie, E. Wunsch Ed. vol.15-1 and 15-II, Thieme, Stuttgart.

[0500] HAMMOND-KOSACK K. E. et al., (1997), Annu. Rev. Plant. Physiol. Plant Mol. Biol., vol. 48: 575-607

[0501] HAMES B. D. and HIGGINS S. J., (1985), Nucleic Acid Hybridization: The Practical Approach, HANES and HIGGINS Ed., IRL Press, Oxford.

[0502] HARPER J. W. et al. (1993), Cell, vol. 75 : 805-816,

[0503] HORSCH et al., (1984).

[0504] IZANT J J, WEINTRAUB H., (1984), Cell, Vol. 36 (4): 1007-1015

[0505] JEFFERSON R. A., KAVANACH, T. A., and BEVAN, M. W. (1987). Embo J. 3901-3907.

[0506] KELLER H., PAMBOUKDJIAN N, PONCHET M, POUPET A, DELON R. VERRIER J L. ROBY D. and RICCI P. (1999). Plant Cell 11, 223-235.

[0507] KOCH Y. (1977), Biochem. Biophys. Res. Commun. Vol. 74: 488-491.

[0508] KOHLER G. and MILSTEIN C., (1975) Nature, vol. 256: 495.

[0509] KOZBOR et al. (1983), Hybridoma, vol. 2 (1): 7-16.

[0510] LIU, Y. G., Mitsukawa, N., Vazquez-Tello, A. and Whittier, F. 1995. Generation of a high quality P1 library of Arabidopsis suitable for chromosome walking. Plant J. 7, 351,358.

[0511] LUMMERZHEIM, M., de Olivera, D., Castresana, C., Miguens F. C.,

[0512] LEBEL E. HEIFETZ P., THORNE L., UKNES S., RYALS, J. AND WARD E. (1998). Plant J. 16, 223-233.

[0513] LOUZADA, E., Roby, D., Van Montagu, M., Timmerman, B. 1993. Identification of compatible and incompatible interactions between Arabidopsis thaliana and Xanthomonas campestris pv campestris and characterization of the hypersensitive response. Mol. Plant Microbe Interact. 6, 532-544.

[0514] MARCO, Y., Raguch, F., Godiard, L. and Froissard, D. 1990. Transcriptional activation of 2 classes of genes during the hypersensitive reaction of tobacco leaves infiltrated with an incompatible strain of the phytopathogenic bacterium, Pseudomonas solanacearum. Plant Mol. Biol. 15, 145-154.

[0515] MYLNE, J., and Botella, J., R. 1998. Binary vectors for sense and antisense expression of Arabidopsis ESTs. Plant Molecular. Biology Reporter 16, p 257-262.

[0516] MARTINEAU et al., JOHNS P. WINTER G. (1998), J. Mol. Biol., vol.280 (1):117-127.

[0517] MERRIFIELD R B, (1965a), Nature, vol. 207 (1996: 522-523)

[0518] MERRIFIELD R. B., (1965b), Science, vol. 150 (693): 178-185.

[0519] McELROY D., ZHANG W, CAO J, WU R (1990) Isolation of an efficient actin promoter for use in rice transformation. Plant Cell 2, p 163-171.

[0520] NARANG S. A. et al., (1979), Methods Enzymol., vol.68, 90-98.

[0521] ODELL JT, NAGY F. CHUA N H. (1985). Identification of DNA sequences required for activity of the cauliflower mosaic virus 35S promoter. Nature 313, p. 810-812.

[0522] PRANDL R., KLOSKE E., SCHOFFL F. (1995). Plant Mol. Biol.28,73-82.

[0523] PEARSON, W. T. and Lipman D. J. (1988). Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. USA 85, 2444-2448.

[0524] ROSSI et al. (1991), Pharmacol. Ther., vol. 5O: 245-254

[0525] RIDDER R. et al., (1995), Biotechnology (NY), vol.13 (3): 255-260 ou encore des anticorps humanises

[0526] REINMANN K. A. et al., (1997), Aids, Res. Hum. Retroviruses, vol. 13 (11): 933-943

[0527] SCHMIDT, R. West., J. Cnops, G., Lov, K. Balestrazzi, A. Dean C. 1996 Detailed description of four YAC contigs representing 17 Mb of chromosome 4 of Arabidopsis thaliana ecotype Columbia. Plant J. 9, 5755-5765.

[0528] SAMBROOK J., FRITSCH E. F. and T. MANIATIS, (1989), Molecular Cloning: A Laboratory Manual 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor NEW YORK.

[0529] SCZAKIEL G. et al. (1995) Trends. Microbiol., vol. 3 (6): 213-217.

[0530] SANDEZ-PESCADOR R., (1988), J. Clin. Microbiol., vol. 26(10): 1934, 1938.

[0531] TOLMASKY, M., E. and Crosa, J., H. (1984). Molecular cloning and expression of genetic determinants for the iron uptake system mediated by the Vibrio anguillarum plasmid pJM1. J Bacteriol 160, p 860-866.

[0532] URDEA N S et al. (1988), Nucleic Acids Research, vol. 11: 4937-4957.

[0533] URDEA N S et al., (1991), Nucleic Acids Symp. Ser., vol. 24: 197-200.

[0534] WABIKO H. et al., (1986), DBA, vol. 5.(4): 305-314.

[0535] WOO, S.-S, Jiang J., Gill, B. S., Paterson, A. H. and Wing, R. A. 1994. Nuc. Acid Res. 22, p 4922-4931.

[0536]

1 61 1 7936 DNA Arabidopsis thaliana 1 tccatggccc aaaagctttt tcccggtgta cttggacttt tcctcactgg tgtgcgttag 60 agtagcaaga atgtattgtt gactattctt ttctcttttt ttaaatacct ttttgttagt 120 ttttgctgtt ggaaaaatat caaatgtaaa aaaaacactg agatttgagt ggttgaatgt 180 ccaatgaaaa agcatcgtcg gcataaacaa aaattccggc gaaatctaag gttagttcaa 240 agtatcaaag cgcgaaatca tgaccaattg tgaaaaggat gaggaattcg tgtgcatcag 300 ctgcgtagaa gaggtacggt actctttcgt gagccacctc tctgaagctc tccgtcgaaa 360 aggcataaat aatgtggtcg tagatgtaga tatcgatgat ctgcttttca aggagtctca 420 ggcaaagatc gagaaagctg gggtttctgt gatggtttta cccggaaact gtgacccttc 480 cgaggtatgg cttgacaagt tcgccaaggt tctcgagtgc caaaggaaca acaaggacca 540 ggcggtggtt tcagtgttgt acggtgacag tctattacgg gaccaatggc ttagcgagct 600 ggatttcaga ggcttatcac gaattcacca atccaggttt tcgctttatt ctactctctg 660 cttttttttt ccttatgttg aaaattggta aaactttttg gatggctagt gtagtgtttt 720 caatatttga tgaaatatgg tgcaggaagg aatgtagtga ctctatactt gtagaagaga 780 ttgtgagaga tgtgtacgag acgcactttt atgttggacg aattggaatc tattcgaagc 840 tgctggagat tgaaaacatg gttaacaagc aaccgatagg catccgttgt gttggaattt 900 ggggtatgcc tggcatagga aagacaacac ttgctaaagc agtctttgac caaatgtcta 960 gcgcctttga tgcttcttgt tttatcgaag actatgacaa atcaattcat gagaagggtc 1020 tttattgttt gctggaggaa caacttttgc cgggtaatga tgcaaccatt atgaaactga 1080 gctcgctcag agacagattg aacagtaaga gagttcttgt tgttctcgat gacgtgtgca 1140 atgctctggt tgcagagtct tttctcgagg ggtttgactg gctaggaccc ggaagcctga 1200 tcatcataac ctctagagat aaacaagtgt ttcgcctttg cggaatcaat caaatatatg 1260 aggtccaggg tttaaatgag aaagaggctc gtcaactttt cttgctgtct gcgtctataa 1320 tggaggatat gggagagcag aatctccatg agttgtcagt gagagtaata agttacgcta 1380 atggaaaccc gttagctatc agtgtttatg gaagagagct gaaaggtaag aaaaaactct 1440 cagaaatgga gactgcattc ctcaaactca agcgacgtcc tccatttaag attgtcgatg 1500 catttaaaag cagctacgac acactcagtg acaacgaaaa gaacattttt ttggacatag 1560 cttgtttttt ccagggagaa aatgtcaact acgtgataca actgcttgag ggttgtggtt 1620 tctttccaca tgttgaaatt gatgtccttg ttgacaagtg tctggtgact atttcagaaa 1680 accgagtttg gttgcataag ctgacccagg atatcggccg agaaatcata aatggagaaa 1740 cagtacagat cgagaggcgc agaagactgt gggaaccttg gagcatcaaa tatttattag 1800 aatataatga acacaaagca aatggagaac ctaaaacaac cttcaaacgt gctcaggttt 1860 gttatatttt gatagtttca tatttattct actttatgag ccaacggcct aatatatcca 1920 cttccaatat tttcagggct ctgaagagat cgaaggcctg tttctagaca catcaaactt 1980 aagatttgat ctgcagccct ctgcctttaa gaatatgttg aaccttagat tgctcaaaat 2040 ttattgttcc aatcctgaag tccatcctgt aatcaatttc ccaacaggct ctctgcattc 2100 tcttcctaat gagctaagac tcctccattg ggagaactat cctctgaaat ctttgcctca 2160 gaattttgat cctaggcacc ttgtcgaaat caacatgccg tatagtcaac ttcagaaact 2220 ttggggtgga accaaggtaa gcaaactctg cgtctttcac atctatcatt acttgaaatt 2280 ttttgttgtt attcatttag cttgcttttg tttttgtctt ctcagaacct ggagatgttg 2340 aggacgatca ggctttgcca ttcccagcat ctagttgata tcgatgatct cttaaaagct 2400 gaaaatcttg aggtaattga tctccaaggt tgtacgagac tgcagaattt cccagccgca 2460 ggtcgattgc tacgtctacg agttgtaaat ctctcaggtt gcataaagat taaaagtgtc 2520 ctagaaattc caccaaatat tgagaaacta catctacagg gaactggcat attagcatta 2580 ccagtttcca ctgttaagcc aaaccataga gagcttgtga attttctaac agaaattccg 2640 ggtctttcag aggcctcgaa acttgagcgt ttaacaagtc tgctggaatc taactcatct 2700 tgtcaagatc ttgggaagct tatttgcttg gagctgaaag attgctcttg tttgcagagt 2760 ctgccaaaca tggctaattt agatcttaat gttcttgatc tctcgggttg ctcaagtctt 2820 aattctattc agggtttccc tcgttttctg aaacagttat atcttggtgg cactgcaata 2880 agagaagtgc cacaacttcc tcaaagtcta gaaatcttga atgcacatgg atcttgtttg 2940 cgaagtctgc caaacatggc taatttagaa tttctcaaag ttcttgatct ctctggttgc 3000 tcagagctcg agactattca gggttttcct cggaacctaa aagagttata ttttgctggc 3060 actacgttaa gagaagtgcc ccaacttcct ttaagcctag aggtcttgaa tgcacatggt 3120 tctgactcgg agaagcttcc tatgcattac aagttcaaca attttttcga tctatctcaa 3180 caagtggtca acgatttttt cttgaaagcg ctgacttatg taaaacacat accaagaggg 3240 tatacgcagg taatactctc tctccttctg cctctctctt gtctacttac ctctccatta 3300 gctctctttc tttccatcac cccctcaatc ctttctcttt ctctgtccgc ctatgtctcc 3360 cgctctctat agtgcctgtt atgtacaccc aaccacaccc tcctcatatc atatctgact 3420 tatttgtatt gcaacaggaa ctcatcaaca aagctccgac tttcagcttc agtgcgccct 3480 cacatacaaa tcaaaacgcc acatttgatc tgcaaccagg atcttctgta atgacacgac 3540 taaatcattc atggaggaac acgcttgtgg gatttggtat gctggtggaa gttgcatttc 3600 ccgaggacta ctgtgatgct acagatgttg gcataagttg tgtttgcaga tggagcaaca 3660 aagaaggccg ctcttgtagg atagaaagaa attttcattg ttgggcacca gggaaagttg 3720 ttccaaaagt tcgaaaggat catacgtttg tctttagtga tgtcaacatg cgcccaagta 3780 ccggtgaagg aaatgaccct gatatctggg ctggattagt tgtatttgag ttctttccta 3840 tcaatcagca gacaaagtgt ctaaatgata ggttcacagt gacaagatgt ggagtccgtg 3900 taataaatgt tgcaactggc aatacaagtc ttgagaacat atcactagtt ttgtctttgg 3960 atccagtgga ggtttctggt tatgaagtat tgagagtcag ctatgatgat ttacaggaga 4020 tggataaagt tctatttctt tacatagcgt ctttgttcaa tgacgaggat gttgattttg 4080 tggcaccact tattgccggt attgacttgg atgttagctc tgggctcaag gtcttagccg 4140 atgtgtctct cataagtgta tcatccaatg gggaaatagt gatgcatagt ttgcaaagac 4200 aaatgggcaa agaaatcctc catggacaat ccatgctgct gtctgattgt gagagttcca 4260 tgaccgagaa tttgtctgac gtaccaaaaa agtaagttct ctttttctat cagcttcata 4320 tacaaccgta gacttaaata aatttgaaaa tatttgagta catataatca cattaggtgc 4380 aatgaaaaaa cgtgtaattt attagattga actaaagttt acaaaaagcc attagagatg 4440 tatcagtatg ctaccttttc caagttgctt aataacatag gtttttcatg tttgtacttt 4500 ttagattaat ttaaaaatcc gtttttttct ccattcaaga ttttagaaaa gtttattttt 4560 tatttaatga aatattagat ttgttgtgaa aaagacatta atttacaatc agaggtaaaa 4620 tctctagaga catttggcat tttcgctcgg ctttctttgt taaatccttt tttcgtgagg 4680 attttagaaa agttgattgt gagctactca ccaaggtttc atctttttga aaaatactct 4740 tacataagat gtgtcaagga aatgttagtt tatctaacaa aaacccactt ctcaacgttt 4800 ccatatctat gtatcttttt actttcatga aaccatgaga gacatatatg cattatgaaa 4860 ttttctgaat tcttacattc cttatagaac ttctctccca cccacaaaga tttgggaatc 4920 attagtcttt tttttttttt ttcaattttc aattgatgtt tgcttttgtt ttacttctta 4980 ttatcatgtt aaaaactttt ttgatgccat aaaaataacc tttatacgta tgttgtaatc 5040 atttttatgg taacatacat tagattttta aactttgcca caaaaaaaaa aaaaaattag 5100 atttttaaaa ttctaaaagt agagttaaac tccgtgtcga tagcaaaaga gggttacatt 5160 acgtgttact ttttactttg tgtgattaat ctgcatttta gaggtagagg gtaaaattgc 5220 aaaaatttgg cgaatggaat ttttttttct tccacaaaac cttatcttaa accgaactga 5280 tcatggcaca cactaacatt gatcacagaa gtagctataa atcacaacat ataatcacat 5340 gggtttcgat atatgttata tcgacgttgg atgcagggag aagaaacatc gcgaaagtaa 5400 ggtaaagaaa gtggtttcca taccggctat agacgaggga gatctatgga cttggcgaaa 5460 gtacggtcaa aaagacatct taggttctcg ttttccaagg tacactcatg tatttttgta 5520 tatacatatt cctatatttg tgtatatatt atattcttta caaacataat aaatagttat 5580 acatataaac aaattattta aatccaacaa aaacaaaata gtattttaca ataggaaagt 5640 tttgtaacaa cttgcaaaac accatttttt atacacaaac gtaatcatcc taaacttgct 5700 ggtggcctgg taaaccttca tatgttgctg gaaaagtgaa tatttctaag aaaaaacaaa 5760 aattattaac cattgaaaaa gtcacattaa gcagaatctg cagactgcag ctttgttttt 5820 ccccttcact aacagtattg cagataaaat cattagcgtg aaactgtaaa atagaaagtt 5880 catttaatta tctacggcgc aaattataag cctttaaatt cttttttatg atgcagactt 5940 atcttataat cacaatattc catttgaagg atattataga tgtgtttttt tttttttaat 6000 ccagttgtgc gtaagtgatc aaattatggt tagttatgac ttttgatggt cacttgatga 6060 cttaattaat tatttgttcc tcaggggtta ctacaggtgc gcttacaagt tcacgcatgg 6120 ttgtaaagct acaaaacaag tccaacggag cgagaccgat tcaaacatgt tagctattac 6180 ttacctatct gagcataacc atccacggcc cactaaacgc aaggctctcg ctgactccac 6240 tcgttccact tcctcctcca tctgctcagc cataactacc tctgcctcat ctagagtctt 6300 ccaaaacaaa gacgaaccaa atcaacccca cttgccttcc tcctccactc ctcctagaaa 6360 cgcggctgtc ttgtttaaaa tgacggacat ggaggagttt caggacaata tggaggtgga 6420 taatgacgtc gtagatacac gtacactggc attgtttcca gagtttcaac atcagccgga 6480 ggaagaagac ccatggtcaa cattcttcga tgattataat ttttactttt gattgagctc 6540 actctcatca ctgcaagaca agaaaaaata accagtgaga tccaaggatt tatgtacacc 6600 caaggattta gtttttgcag tgtcaatgat tttcagccaa taataaatca ttaatttcaa 6660 ttaccagtcc tccccaacct gtaaacccat catgcctcca acatacggtg atctcgacaa 6720 attcgaaagc ctaataccaa ggatacatgt atcacagtag gatttctcaa tcctgatgca 6780 ctcgaagagg aatagatctt aagaaatcat taagccttat gaaagcaagg aaagaaatgg 6840 ctgagatgtc ttacgtacgt tgattacaaa tagaggcaat caacacaaca attgttttgt 6900 tgtgggtttt gtgtcgatat accattgcgg ttaagagcct gccaaatggg tatgtaagct 6960 agagatctct ctcatgtgtt tattctcttt gatttgtgta aagctttgaa ctttcctatt 7020 ttcaataaaa ccatagtctg aagaggtgaa tttcgttatc cctattttca actaaagatg 7080 tagtatatga aaattaccta aaatcttttt caaaatcctt tcttaaaagg tttgataatt 7140 tttgaataac aatggatttg atttgagttt cacaaatcat tgattgaata gaagaaaagg 7200 atggtaaatg attttgaatt attttacata aatcaataat taataacagg tgattaataa 7260 aaaagactcg aaatcttcaa agagcatctc cataagagta tcttaacaat aaaatgacat 7320 tatttctata gaatttagtt gtttaattaa atttaaattt gttacaaaat taacccaaca 7380 atcaagtgac atgggtaaaa tagagtttct aaatcataag tttctcaaaa gttacatgtg 7440 taatataccg gctatgcttt ggtttctgtt gtctgtgaag atgatccttc tggaaggtct 7500 cgaaggcttc ctctggactc cccttcgatt catgtttctg tattgacgcg atgtctagat 7560 cttccgtttg tgacatagtg aatgaattaa gaaggggttt gtttctttca gtagtttaag 7620 gatgcgtttt gtaacctcgg aattagtttg gactcggcca cttgtgtatt tctaagttct 7680 tagtatttga tctttggctt gctttgtggt tacctttcct tcaagcttta ggattttagt 7740 agttgagact cttgcagaag tttaggattg attggattgc actcatggct ccactgggtc 7800 gcgcgttaag tgtaaaggtt cctaggacgg gggttaagtg agagatttga tttgtcctct 7860 cttagtattt tattttctgc ttgaactttc tgatatatct cctctatgtc ttagatcctt 7920 cttagtttga tccatt 7936 2 259 DNA Arabidopsis thaliana 2 tccatggccc aaaagctttt tcccggtgta cttggacttt tcctcactgg tgtgcgttag 60 agtagcaaga atgtattgtt gactattctt ttctcttttt ttaaatacct ttttgttagt 120 ttttgctgtt ggaaaaatat caaatgtaaa aaaaacactg agatttgagt ggttgaatgt 180 ccaatgaaaa agcatcgtcg gcataaacaa aaattccggc gaaatctaag gttagttcaa 240 agtatcaaag cgcgaaatc 259 3 1404 DNA Arabidopsis thaliana 3 ttgagctcac tctcatcact gcaagacaag aaaaaataac cagtgagatc caaggattta 60 tgtacaccca aggatttagt ttttgcagtg tcaatgattt tcagccaata ataaatcatt 120 aatttcaatt accagtcctc cccaacctgt aaacccatca tgcctccaac atacggtgat 180 ctcgacaaat tcgaaagcct aataccaagg atacatgtat cacagtagga tttctcaatc 240 ctgatgcact cgaagaggaa tagatcttaa gaaatcatta agccttatga aagcaaggaa 300 agaaatggct gagatgtctt acgtacgttg attacaaata gaggcaatca acacaacaat 360 tgttttgttg tgggttttgt gtcgatatac cattgcggtt aagagcctgc caaatgggta 420 tgtaagctag agatctctct catgtgttta ttctctttga tttgtgtaaa gctttgaact 480 ttcctatttt caataaaacc atagtctgaa gaggtgaatt tcgttatccc tattttcaac 540 taaagatgta gtatatgaaa attacctaaa atctttttca aaatcctttc ttaaaaggtt 600 tgataatttt tgaataacaa tggatttgat ttgagtttca caaatcattg attgaataga 660 agaaaaggat ggtaaatgat tttgaattat tttacataaa tcaataatta ataacaggtg 720 attaataaaa aagactcgaa atcttcaaag agcatctcca taagagtatc ttaacaataa 780 aatgacatta tttctataga atttagttgt ttaattaaat ttaaatttgt tacaaaatta 840 acccaacaat caagtgacat gggtaaaata gagtttctaa atcataagtt tctcaaaagt 900 tacatgtgta atataccggc tatgctttgg tttctgttgt ctgtgaagat gatccttctg 960 gaaggtctcg aaggcttcct ctggactccc cttcgattca tgtttctgta ttgacgcgat 1020 gtctagatct tccgtttgtg acatagtgaa tgaattaaga aggggtttgt ttctttcagt 1080 agtttaagga tgcgttttgt aacctcggaa ttagtttgga ctcggccact tgtgtatttc 1140 taagttctta gtatttgatc tttggcttgc tttgtggtta cctttccttc aagctttagg 1200 attttagtag ttgagactct tgcagaagtt taggattgat tggattgcac tcatggctcc 1260 actgggtcgc gcgttaagtg taaaggttcc taggacgggg gttaagtgag agatttgatt 1320 tgtcctctct tagtatttta ttttctgctt gaactttctg atatatctcc tctatgtctt 1380 agatccttct tagtttgatc catt 1404 4 4343 DNA Arabidopsis thaliana 4 gtccaatgaa aaagcatcgt cggcataaac aaaaattccg gcgaaatcta aggttagttc 60 aaagtatcaa agcgcgaaat catgaccaat tgtgaaaagg atgaggaatt cgtgtgcatc 120 agctgcgtag aagaggtacg gtactctttc gtgagccacc tctctgaagc tctccgtcga 180 aaaggcataa ataatgtggt cgtagatgta gatatcgatg atctgctttt caaggagtct 240 caggcaaaga tcgagaaagc tggggtttct gtgatggttt tacccggaaa ctgtgaccct 300 tccgaggtat ggcttgacaa gttcgccaag gttctcgagt gccaaaggaa caacaaggac 360 caggcggtgg tttcagtgtt gtacggtgac agtctattac gggaccaatg gcttagcgag 420 ctggatttca gaggcttatc acgaattcac caatccagga aggaatgtag tgactctata 480 cttgtagaag agattgtgag agatgtgtac gagacgcact tttatgttgg acgaattgga 540 atctattcga agctgctgga gattgaaaac atggttaaca agcaaccgat aggcatccgt 600 tgtgttggaa tttggggtat gcctggcata ggaaagacaa cacttgctaa agcagtcttt 660 gaccaaatgt ctagcgcctt tgatgcttct tgttttatcg aagactatga caaatcaatt 720 catgagaagg gtctttattg tttgctggag gaacaacttt tgccgggtaa tgatgcaacc 780 attatgaaac tgagctcgct cagagacaga ttgaacagta agagagttct tgttgttctc 840 gatgacgtgt gcaatgctct ggttgcagag tcttttctcg aggggtttga ctggctagga 900 cccggaagcc tgatcatcat aacctctaga gataaacaag tgtttcgcct ttgcggaatc 960 aatcaaatat atgaggtcca gggtttaaat gagaaagagg ctcgtcaact tttcttgctg 1020 tctgcgtcta taatggagga tatgggagag cagaatctcc atgagttgtc agtgagagta 1080 ataagttacg ctaatggaaa cccgttagct atcagtgttt atggaagaga gctgaaaggt 1140 aagaaaaaac tctcagaaat ggagactgca ttcctcaaac tcaagcgacg tcctccattt 1200 aagattgtcg atgcatttaa aagcagctac gacacactca gtgacaacga aaagaacatt 1260 tttttggaca tagcttgttt tttccaggga gaaaatgtca actacgtgat acaactgctt 1320 gagggttgtg gtttctttcc acatgttgaa attgatgtcc ttgttgacaa gtgtctggtg 1380 actatttcag aaaaccgagt ttggttgcat aagctgaccc aggatatcgg ccgagaaatc 1440 ataaatggag aaacagtaca gatcgagagg cgcagaagac tgtgggaacc ttggagcatc 1500 aaatatttat tagaatataa tgaacacaaa gcaaatggag aacctaaaac aaccttcaaa 1560 cgtgctcagg gctctgaaga gatcgaaggc ctgtttctag acacatcaaa cttaagattt 1620 gatctgcagc cctctgcctt taagaatatg ttgaacctta gattgctcaa aatttattgt 1680 tccaatcctg aagtccatcc tgtaatcaat ttcccaacag gctctctgca ttctcttcct 1740 aatgagctaa gactcctcca ttgggagaac tatcctctga aatctttgcc tcagaatttt 1800 gatcctaggc accttgtcga aatcaacatg ccgtatagtc aacttcagaa actttggggt 1860 ggaaccaaga acctggagat gttgaggacg atcaggcttt gccattccca gcatctagtt 1920 gatatcgatg atctcttaaa agctgaaaat cttgaggtaa ttgatctcca aggttgtacg 1980 agactgcaga atttcccagc cgcaggtcga ttgctacgtc tacgagttgt aaatctctca 2040 ggttgcataa agattaaaag tgtcctagaa attccaccaa atattgagaa actacatcta 2100 cagggaactg gcatattagc attaccagtt tccactgtta agccaaacca tagagagctt 2160 gtgaattttc taacagaaat tccgggtctt tcagaggcct cgaaacttga gcgtttaaca 2220 agtctgctgg aatctaactc atcttgtcaa gatcttggga agcttatttg cttggagctg 2280 aaagattgct cttgtttgca gagtctgcca aacatggcta atttagatct taatgttctt 2340 gatctctcgg gttgctcaag tcttaattct attcagggtt tccctcgttt tctgaaacag 2400 ttatatcttg gtggcactgc aataagagaa gtgccacaac ttcctcaaag tctagaaatc 2460 ttgaatgcac atggatcttg tttgcgaagt ctgccaaaca tggctaattt agaatttctc 2520 aaagttcttg atctctctgg ttgctcagag ctcgagacta ttcagggttt tcctcggaac 2580 ctaaaagagt tatattttgc tggcactacg ttaagagaag tgccccaact tcctttaagc 2640 ctagaggtct tgaatgcaca tggttctgac tcggagaagc ttcctatgca ttacaagttc 2700 aacaattttt tcgatctatc tcaacaagtg gtcaacgatt ttttcttgaa agcgctgact 2760 tatgtaaaac acataccaag agggtatacg caggaactca tcaacaaagc tccgactttc 2820 agcttcagtg cgccctcaca tacaaatcaa aacgccacat ttgatctgca accaggatct 2880 tctgtaatga cacgactaaa tcattcatgg aggaacacgc ttgtgggatt tggtatgctg 2940 gtggaagttg catttcccga ggactactgt gatgctacag atgttggcat aagttgtgtt 3000 tgcagatgga gcaacaaaga aggccgctct tgtaggatag aaagaaattt tcattgttgg 3060 gcaccaggga aagttgttcc aaaagttcga aaggatcata cgtttgtctt tagtgatgtc 3120 aacatgcgcc caagtaccgg tgaaggaaat gaccctgata tctgggctgg attagttgta 3180 tttgagttct ttcctatcaa tcagcagaca aagtgtctaa atgataggtt cacagtgaca 3240 agatgtggag tccgtgtaat aaatgttgca actggcaata caagtcttga gaacatatca 3300 ctagttttgt ctttggatcc agtggaggtt tctggttatg aagtattgag agtcagctat 3360 gatgatttac aggagatgga taaagttcta tttctttaca tagcgtcttt gttcaatgac 3420 gaggatgttg attttgtggc accacttatt gccggtattg acttggatgt tagctctggg 3480 ctcaaggtct tagccgatgt gtctctcata agtgtatcat ccaatgggga aatagtgatg 3540 catagtttgc aaagacaaat gggcaaagaa atcctccatg gacaatccat gctgctgtct 3600 gattgtgaga gttccatgac cgagaatttg tctgacgtac caaaaaagga gaagaaacat 3660 cgcgaaagta aggtaaagaa agtggtttcc ataccggcta tagacgaggg agatctatgg 3720 acttggcgaa agtacggtca aaaagacatc ttaggttctc gttttccaag gggttactac 3780 aggtgcgctt acaagttcac gcatggttgt aaagctacaa aacaagtcca acggagcgag 3840 accgattcaa acatgttagc tattacttac ctatctgagc ataaccatcc acggcccact 3900 aaacgcaagg ctctcgctga ctccactcgt tccacttcct cctccatctg ctcagccata 3960 actacctctg cctcatctag agtcttccaa aacaaagacg aaccaaatca accccacttg 4020 ccttcctcct ccactcctcc tagaaacgcg gctgtcttgt ttaaaatgac ggacatggag 4080 gagtttcagg acaatatgga ggtggataat gacgtcgtag atacacgtac actggcattg 4140 tttccagagt ttcaacatca gccggaggaa gaagacccat ggtcaacatt cttcgatgat 4200 tataattttt acttttgatt gagctcactc tcatcactgc aagacaagaa aaaataacca 4260 gtgagatcca aggatttatg tacacccaag gatttagttt ttgcagtgtc aatgattttc 4320 agccaataat aaatcattaa ttt 4343 5 9399 DNA Arabidopsis thaliana 5 gcatgcctcg aaaacttgcc ctgccacgtc ttatatagct ctttcaggag tgtggtttta 60 ccaattccgg gcatcccaac aactccaatg atacgagttc ccttgtattt atcacgatcc 120 aacttctctt ccaaatcttt taaccgttgt tcgtttccaa aagtctcatg ctttttgtct 180 cctgaggaag ttccagcgtt gctattacct aaagcaccca cgacggcgtt gtgacttccc 240 tccggtggta ttccggtcaa cgctgtcttc accgccttca caatttcatt gactttctca 300 ctctccacac tgcaatttca aggctaatga gatcatgacc tccctaagtt ccataaaagg 360 ttctagaagt attcatagat ctgtctacaa ttgtaatcta atcactactg tgtctaggtt 420 tgtaactgag tcaaaaaaaa taaaaaaaaa tggtaccttt tcttgtcaat gatgatgccc 480 ataatgttag gaatcaagtt aaaagcttcc ttccatttct ttttcctctc atcaccctta 540 gccatactcc taaatctatc accgaacttt cctttcaaat ctctaacggt ggatggctcc 600 agcttgtaga agattggaat cgcaacgagt gttccttcat ccgtacaatc tttgatcttc 660 tccagctctc tcacgcacca gactgactcg gtgtagttgc cggagaagat agccaaaacg 720 attttggact cctctatcct cttcagcagt acatctagag gttgacctct gtcttcatag 780 tcgtcgataa agacgttgat gttgttcaat ttcaaggccg ttacgagatg gctgacgaat 840 ctccggcgca aatctgcccc acggaaattg atgaacacct gatgctgcgg tggcttgtct 900 tccacagtgg aaatagatga tgtctccatg gcccaaaagc tttttcccgg tgtacttgga 960 cttttcctca ctggtgtgcg ttagagtagc aagaatgtat tgttgactat tcttttctct 1020 ttttttaaat acctttttgt tattttttgc tgttggaaaa atatcaaatg taaaaaaaca 1080 actgagattt gagtggttga atgtccaatg aaaaagcatc gtcggcataa acaaaaattc 1140 cggcgaaatc aaaggttagt tcaaagtatc aaagcgcgaa atcatgacca attgtgaaaa 1200 ggatgaggaa ttcgtgtgca tcagctgcgt agaagaggta cggtactctt tcgtgagcca 1260 cctctctgaa gctctccgtc gaaaaggcat aaataatgtg gtcgtagatg tagatatcga 1320 tgatctgctt ttcaaggagt ctcaggcaaa gatcgagaaa gctggggttt ctgtgatggt 1380 tttacccgga aactgtgatc cttccgaggt atggcttgac aagttcgcca aggttctcga 1440 gtgccaaagg aacaacaagg accaggcggt ggtttcagtg ttgtacggtg acagtctatt 1500 acgggaccaa tggcttagcg agctggattt cagaggctta tcacgaattc accaatccag 1560 gttttcgctt tattctactc tctgcttttt ttttccttat gttgaaaatt ggtaaaactt 1620 tttggatggc tagtgtagtg ttttcaatat ttgatgaaat atggtgcagg aaggaatgta 1680 gtgactctat acttgtagaa gagattgtga gagatgtgta cgagacgcac ttttatgttg 1740 gacgaattgg aatctattcg aagctgctgg agattgaaaa catggttaac aagcaaccga 1800 taggcatccg ttgtgttgga atttggggta tgcctggcat aggaaagaca acacttgcta 1860 aagcagtctt tgaccaaatg tctagcgcct ttgatgcttc ttgttttatc gaagactatg 1920 acaaatcaat tcatgagaag ggtctttatt gtttgctgga ggaacaactt ttgccgggca 1980 atgatgcaac cattatgaaa ctgagctcgc tcagagacag attgaacagt aagagagttc 2040 ttgttgttct cgatgacgtg cgcaatgctc tggttgggga gtcctttctc gaggggtttg 2100 actggctagg acccggaagc ctgatcatca taacctctag agataaacaa gtgttttgcc 2160 tttgcggaat caatcaaata tatgaggtcc agggtttaaa tgagaaagag gctcgtcaac 2220 ttttcttgct gtctgcgtct ataaaggagg atatgggaga gcagaatctc caggagttgt 2280 cagtgagagt aataaattat gctaatggaa acccgttagc tatcagtgtt tatggaagag 2340 agctgaaagg taagaaaaaa ctctcagaaa tggagactgc attcctcaaa ctcaagcgac 2400 gtcctccatt taagattgtc gatgcattta aaagcaccta tgacacactc agtgacaacg 2460 aaaagaacat ttttttggac atagcttgtt tcttccaggg agaaaatgtc aactacgtga 2520 tacaactgct tgagggttgt ggtttctttc cacatgttga aattgatgtc cttgttgaca 2580 agtgtctggt aactatttca gaaaaccgag tttggttgca taagctgacc caggatatcg 2640 gccgagaaat cataaatgga gaaacagtac agatcgagag gcgcagaaga ctgtgggaac 2700 cttggagcat caaatattta ttagaatata atgaacacaa agcaaatgga gaacctaaaa 2760 caaccttcaa acgtgctcag gtttgttata ttttgatagt ttcatatttt ttctacttta 2820 tgagccaacg gcctaatata tccacttcca atattttcag ggctctgaag agatcgaagg 2880 cctgtttcta gacacatcaa acttaagatt tgatctgcag ccctctgcct ttaagaatat 2940 gttgaacctt agattgctca aaatttattg ttccaatcct gaagtccatc ctgtaatcaa 3000 tttcccaaca ggctctctgc attctcttcc taatgagcta agactcctcc attgggagaa 3060 ctatcctctg aaatctttgc ctcagaattt tgatccaagg caccttgtcg aaatcaacat 3120 gccgtatagt caacttcaga aactttgggg tggaaccaag gtaagcaatc tctgatatgc 3180 gtcgttcacc cttatcatta cttgacaaat tttgttgtta ttcatttagc ttgctttttt 3240 tttgtcttct cagaacctgg agatgttgag gacgatcagg ctttgccatt cccaccatct 3300 agttgatatc gatgatctct taaaagctga aaatcttgag gtaattgatc tccaaggttg 3360 tacgagactg cagaatttcc cagccgcagg tcgattgcta cgtctacgag ttgtaaatct 3420 ctcaggttgc ataaagatta aaagtgtcct agaaattcca ccaaatattg agaaactaca 3480 tctacaggga actggcatat tagcattacc agtttccact gttaagccaa accatagaga 3540 gcttgtgaat tttctaacag aaattccggg tctttcagag gaacttgagc gtttaacaag 3600 tctgctggaa tctaactcat cttgtcaaga tcttgggaag cttatttgct tggagctgaa 3660 agattgctct tgtttgcaga gtctgccaaa catggctaat ttagatctta atgttcttga 3720 tctctcgggt tgctcaagtc ttaattctat tcagggtttc cctcgttttc tgaaacagtt 3780 atatcttggt ggcactgcaa taagagaagt gccacaactt cctcaaagtc tagaaatctt 3840 gaatgcacat ggatcttgtt tgcgaagtct gccaaacatg gctaatttag aatttctcaa 3900 agttcttgat ctctctggtt gctcagagct cgagactatt cagggttttc ctcggaacct 3960 aaaagagtta tattttgctg gcactacgtt aagagaagtg ccccaacttc ctttaagcct 4020 agaggtcttg aatgcacatg gttctgactc ggagaagctt cctatgcatt acaagttcaa 4080 caattttttc gatctatctc aacaagtggt caacgatttt ttattgaaaa cgctgactta 4140 tgtaaaacac ataccaagag ggtatacgca ggtaatactc tctctccttc tgcctctctc 4200 ttgtctactt acctctccat tagctctctt tctttccatc accccctcaa tcctttctct 4260 ttctctgttc gcctatgtct cccgctctct atagtgcctg ttatgtacac ccaaccacac 4320 cctcctcata tcatatctga cttatttgta ttgcaacagg aactcatcaa caaagctccg 4380 actttcagct tcagtgcgcc ctcacataca aatcaaaacg ccacatttga tctgcaatca 4440 ggatcttctg taatgacacg actaaatcat tcatggagga acacgcttgt gggatttggt 4500 atgctggtgg aagttgcatt tcccgaggac tactgtgatg ctacagatgt tggcataagt 4560 tgtgtttgca gatggagcaa caaagaaggc cgctcttgta ggatagaaag aaaatttcat 4620 tgttgggcac catggcaagt tgttccaaaa gttcgaaagg atcatacgtt tgtctttagt 4680 gatgtcaaca tgcgcccaag taccggtgaa ggaaatgacc ctgatatctg ggctggatta 4740 gttgtatttg agttctttcc tatcaatcag cagacaaagt gtctaaatga taggttcaca 4800 gtgagaagat gtggagtccg tgtaataaat gttgcaactg gcaatacaag tcttgagaac 4860 atagcactag ttttgtcttt ggatccagta gaggtttccg gttatgaagt attgagagtc 4920 agctatgatg atttacagga gatggataaa gttctatttc tttacatagc gtctttgttc 4980 aatgacgagg atgttgattt tgtggcacca cttattgccg gtattgactt ggatgttagc 5040 tctgggctca aggtcttagc cgatgtgtct ctcataagtg tatcatcaaa tggggaaata 5100 gtgatgcata gtttgcaaag acaaatgggt aaagaaatcc tccatggaca atccatgctg 5160 ctgtctgatt gtgagagttc catgaccgag aatttgtctg acgtaccaaa aaagtaagtt 5220 ctctttttct atcagcttca tatacaaccg tagacttaaa taaatttgaa aatatttgag 5280 tacataaaat cacattaggt gcaatgaaaa aacgtgcaat ttattagatt gaactaaagt 5340 ttacaaaaaa gccattagag atgtaccagt atgctacctt ttccaagttg cttaataaca 5400 taggtttttc atgtttgtac tttttagatt aatttaaaaa tccgtttttt actccattca 5460 agattttaga aaagtttatt ttttatttaa tgaaatatta gatttgttgt gaaaaagaca 5520 ttaatttaca atcagaggta aaatctctag agacatttgg catttttgct cggctttctt 5580 tgttaaatcc ttttttcgtg aggattttag aaaagttgat tgtgagctac tcaccaaggt 5640 ttcatctttt tgaaaaatac tcttacataa gatgtgtcaa ggaaatgtta gtttatctaa 5700 caaaaaccca cttctcaacg tttccatatc tatgtatctt tttactttca tgaaaccatg 5760 agagacatat atgcattatg aaattttctg aattcttaca ttccttatag aacttctctc 5820 ccacccacaa agatttggga atcattagtc ttttttattt tttttcaatt ttcaattgat 5880 gtttgctttt gttttacttc ttattatcat gttaaaaact tttttgatgc cataaaaata 5940 acctttatac gtatgttgta atcgttttta tggtaacata tattagattt ttaaactttg 6000 caaaaaaaaa aaaaaaaaaa acattagatt tttaaaattc taaaagtagg gttaaactcc 6060 gtgtcgatag caaaaaaggg ttacattacg tgttactttt tactttgtgt gattaatttg 6120 cattttagag gtagagggta aaattgcaaa aatttggcga atgaaaattt ttttttgcac 6180 aaaaccttat cttaaaccga actgatcatg gcacacacta acattgatca cagaactagc 6240 tataaatcac aatatataat cacatgggtt tcgatatatg ttatatcgac gttggatgca 6300 ggaagaagaa acatagcgaa agtagggtaa agaaagtggt ttccataccg gctatagacg 6360 agggagatct atggacttgg cgaaagtacg gtcaaaaaga catcttaggt tctcgttttc 6420 caaggtacac tcatgtattt ttgtatatac atattcctat atttgtgtat atattatatt 6480 ctttacaaac ataataaata gttatacata taaacaaatt atttaaatcc aacaaaaaca 6540 aaatagtatt ttacaaaagg aaagttttgt aacaacttgc aaaacaccat tttttataca 6600 caaacgtaat catcctaaac ttgctggtgg cctggtaaac cttcatatgt tgctggaaaa 6660 gtgaatattt ctaagaaaaa acaaaaatta ttaaccattg aaaaagtcac attaagcaga 6720 atactgcagc tttgtaaaat agaaagttca tttaattatc tacggcgcaa attataagcc 6780 tttaaattct tttttatgat gcagacttat cttataatca caatattcca tttgaaggat 6840 attatagatg tttttttttt tttttaatcc agttgtgcgt aagtgatcaa attatggtta 6900 gttatgactt ttgatggtca cttgatgact taattaatta tttgttcctc aggggttact 6960 acaggtgcgc ttacaagttc acgcatggtt gtaaagctac aaaacaagtc caacggagcg 7020 agaccgattc aaacatgtta gctattactt acctatctga gcataaccat ccacggccca 7080 ctaaacgcaa ggctctcgct gactccactc gttccacttc ctcctccatc tgctgagcca 7140 taactacctc tgcctcatct agagtcttcc aaaacaaaga cgaaccaaat caaccccact 7200 tgccttcctc ctccactcct cctggaaacg cggctgtctt gtttaaaatg acggacatgg 7260 aggagtttca ggacaatatg gaggtggata atgacgtcgt agatacacgt acactggcat 7320 tgtttccaga gtttcaacat cagccggagg aagaataccc atggtcaaca ttcttcgatg 7380 attataattt ttgtttttat tgagctcact ctcatcactg caagacaaga aaaaataacc 7440 agtgagatcc aaggatttat gtacacccaa ggatttagtt tttgcagtgt caatgatttt 7500 cagccaataa taaatcatta atttcaatta ccagtcctcc ccaacctgta aacccatcat 7560 gcctccaaca tatggtgatc tcgacaaatt cgaaagccta ataccaagga tacatgtatc 7620 acagtaggat ttctcaatcc tgatgcactc gaagaggaat agatcttaag aaatcattaa 7680 gccttatgaa agcaaggaaa gaaatggctg agatgtctta cgttgattac aaatagaggc 7740 aatcaacaca acaattgttt tgttgtgggt tttgtgtcga tataccattg cggttaagag 7800 cctgccaaat gggtatgtaa gctagagatc tctctcatgt gtttattctc tttgatttgt 7860 gtaaagcttt gaactttcct attttcaata aaaccatagt ctgaagaggt gaatttcttt 7920 atccctattt tcaactaaag atgtagtata tgaaaattac ctaaaatctt tttcaaaatc 7980 ctttcttaaa aggtttgata atttttgaat aacaatggat ttgatttgag tttcacaaat 8040 cattgattga atagaagaaa aggatggtaa atgattttga attattttac ataaatcaat 8100 aattaataac aggtgattaa taaaaaagac tctaaatctt caaagagcat ctccataaga 8160 gtatcttaac aataaaatga tattatttct atagaattta gttgtttaat taaatttaaa 8220 tttgttacaa aattaaccca acaatcaagt gacatgggta aaatagagtt tctaaatcat 8280 aagtttctca aaagttacat gtgtaatata ccggctatgc tttggtttct gttgtctgtg 8340 aagatgatcc ttctggaagg tctcgaaggc ttcctctgga ctccccttcg attcatgttt 8400 ctgtattgac gcgatgtcta gatcttccgt ttgtgacata gtgaatgaat taagaagggg 8460 tttgtttctt tcagtagttt aaggatgcgt tttgtaacct cggaattagt ttggactcgg 8520 ccacttgtgt atttctaagt tcttagtatt tgatctttgg cttgctttgt ggttaccttt 8580 ccttcaagct ttaggatttt agtagttgag actcttgcag aagtttagga ttgattggat 8640 tgcactcatg gctccgctgg gtcgcgcgtt aagtgtaaag gttcctagga cgggggttaa 8700 gtgagagatt tgatttgtcc tctcttagta ttttattttc tgcttgaact ttctgatata 8760 tctcctctat gtctttgatc cttcttagtt tgatccattt cttttgacct tgctaagatg 8820 gtttcaaggc ttaatttcat attttggcaa aaaacagatg gaatgcagtg ttatagtggg 8880 ttgaaaccgg agatattgag ttaatctctc ttccaccggt tgttgaaata ttttcaagcc 8940 tatggtaggt tttcatgtgc ccctttctcg taaaatcaac aagctgaagc actattggta 9000 tctcatgtag tgtttgaact actctcttat gtggtttaag attattaaga atttcacaat 9060 gcaaatcaca gatatgtaat cgagtagcta tgtgtttggc taaacaagag tttcgcattc 9120 caatgtatgg ttcctttgat tgatatataa aagttacact ttgtgaaaaa aaaaaaaaaa 9180 aaaaaacagg aatccttcac ttttttctat tacttttgaa attttttatt gtgattttag 9240 ataccctaat gaagatgcta taaagcaaat ccctctttct ttttatttct gctaaacaat 9300 atcaaaaatt ctgtgcacga acacatctct gcacaagttt gggtcgccat tttcaattat 9360 cccatattca ttggaggaaa tactaagaat agagcatgc 9399 6 1183 DNA Arabidopsis thaliana 6 gcatgcctcg aaaacttgcc ctgccacgtc ttatatagct ctttcaggag tgtggtttta 60 ccaattccgg gcatcccaac aactccaatg atacgagttc ccttgtattt atcacgatcc 120 aacttctctt ccaaatcttt taaccgttgt tcgtttccaa aagtctcatg ctttttgtct 180 cctgaggaag ttccagcgtt gctattacct aaagcaccca cgacggcgtt gtgacttccc 240 tccggtggta ttccggtcaa cgctgtcttc accgccttca caatttcatt gactttctca 300 ctctccacac tgcaatttca aggctaatga gatcatgacc tccctaagtt ccataaaagg 360 ttctagaagt attcatagat ctgtctacaa ttgtaatcta atcactactg tgtctaggtt 420 tgtaactgag tcaaaaaaaa taaaaaaaaa tggtaccttt tcttgtcaat gatgatgccc 480 ataatgttag gaatcaagtt aaaagcttcc ttccatttct ttttcctctc atcaccctta 540 gccatactcc taaatctatc accgaacttt cctttcaaat ctctaacggt ggatggctcc 600 agcttgtaga agattggaat cgcaacgagt gttccttcat ccgtacaatc tttgatcttc 660 tccagctctc tcacgcacca gactgactcg gtgtagttgc cggagaagat agccaaaacg 720 attttggact cctctatcct cttcagcagt acatctagag gttgacctct gtcttcatag 780 tcgtcgataa agacgttgat gttgttcaat ttcaaggccg ttacgagatg gctgacgaat 840 ctccggcgca aatctgcccc acggaaattg atgaacacct gatgctgcgg tggcttgtct 900 tccacagtgg aaatagatga tgtctccatg gcccaaaagc tttttcccgg tgtacttgga 960 cttttcctca ctggtgtgcg ttagagtagc aagaatgtat tgttgactat tcttttctct 1020 ttttttaaat acctttttgt tattttttgc tgttggaaaa atatcaaatg taaaaaaaca 1080 actgagattt gagtggttga atgtccaatg aaaaagcatc gtcggcataa acaaaaattc 1140 cggcgaaatc aaaggttagt tcaaagtatc aaagcgcgaa atc 1183 7 2263 DNA Arabidopsis thaliana 7 gccataacta cctctgcctc atctagagtc ttccaaaaca aagacgaacc aaatcaaccc 60 cacttgcctt cctcctccac tcctcctgga aacgcggctg tcttgtttaa aatgacggac 120 atggaggagt ttcaggacaa tatggaggtg gataatgacg tcgtagatac acgtacactg 180 gcattgtttc cagagtttca acatcagccg gaggaagaat acccatggtc aacattcttc 240 gatgattata atttttgttt ttattgagct cactctcatc actgcaagac aagaaaaaat 300 aaccagtgag atccaaggat ttatgtacac ccaaggattt agtttttgca gtgtcaatga 360 ttttcagcca ataataaatc attaatttca attaccagtc ctccccaacc tgtaaaccca 420 tcatgcctcc aacatatggt gatctcgaca aattcgaaag cctaatacca aggatacatg 480 tatcacagta ggatttctca atcctgatgc actcgaagag gaatagatct taagaaatca 540 ttaagcctta tgaaagcaag gaaagaaatg gctgagatgt cttacgttga ttacaaatag 600 aggcaatcaa cacaacaatt gttttgttgt gggttttgtg tcgatatacc attgcggtta 660 agagcctgcc aaatgggtat gtaagctaga gatctctctc atgtgtttat tctctttgat 720 ttgtgtaaag ctttgaactt tcctattttc aataaaacca tagtctgaag aggtgaattt 780 ctttatccct attttcaact aaagatgtag tatatgaaaa ttacctaaaa tctttttcaa 840 aatcctttct taaaaggttt gataattttt gaataacaat ggatttgatt tgagtttcac 900 aaatcattga ttgaatagaa gaaaaggatg gtaaatgatt ttgaattatt ttacataaat 960 caataattaa taacaggtga ttaataaaaa agactctaaa tcttcaaaga gcatctccat 1020 aagagtatct taacaataaa atgatattat ttctatagaa tttagttgtt taattaaatt 1080 taaatttgtt acaaaattaa cccaacaatc aagtgacatg ggtaaaatag agtttctaaa 1140 tcataagttt ctcaaaagtt acatgtgtaa tataccggct atgctttggt ttctgttgtc 1200 tgtgaagatg atccttctgg aaggtctcga aggcttcctc tggactcccc ttcgattcat 1260 gtttctgtat tgacgcgatg tctagatctt ccgtttgtga catagtgaat gaattaagaa 1320 ggggtttgtt tctttcagta gtttaaggat gcgttttgta acctcggaat tagtttggac 1380 tcggccactt gtgtatttct aagttcttag tatttgatct ttggcttgct ttgtggttac 1440 ctttccttca agctttagga ttttagtagt tgagactctt gcagaagttt aggattgatt 1500 ggattgcact catggctccg ctgggtcgcg cgttaagtgt aaaggttcct aggacggggg 1560 ttaagtgaga gatttgattt gtcctctctt agtattttat tttctgcttg aactttctga 1620 tatatctcct ctatgtcttt gatccttctt agtttgatcc atttcttttg accttgctaa 1680 gatggtttca aggcttaatt tcatattttg gcaaaaaaca gatggaatgc agtgttatag 1740 tgggttgaaa ccggagatat tgagttaatc tctcttccac cggttgttga aatattttca 1800 agcctatggt aggttttcat gtgccccttt ctcgtaaaat caacaagctg aagcactatt 1860 ggtatctcat gtagtgtttg aactactctc ttatgtggtt taagattatt aagaatttca 1920 caatgcaaat cacagatatg taatcgagta gctatgtgtt tggctaaaca agagtttcgc 1980 attccaatgt atggttcctt tgattgatat ataaaagtta cactttgtga aaaaaaaaaa 2040 aaaaaaaaaa caggaatcct tcactttttt ctattacttt tgaaattttt tattgtgatt 2100 ttagataccc taatgaagat gctataaagc aaatccctct ttctttttat ttctgctaaa 2160 caatatcaaa aattctgtgc acgaacacat ctctgcacaa gtttgggtcg ccattttcaa 2220 ttatcccata ttcattggag gaaatactaa gaatagagca tgc 2263 8 4336 DNA Arabidopsis thaliana 8 gtccaatgaa aaagcatcgt cggcataaac aaaaattccg gcgaaatcaa aggttagttc 60 aaagtatcaa agcgcgaaat catgaccaat tgtgaaaagg atgaggaatt cgtgtgcatc 120 agctgcgtag aagaggtacg gtactctttc gtgagccacc tctctgaagc tctccgtcga 180 aaaggcataa ataatgtggt cgtagatgta gatatcgatg atctgctttt caaggagtct 240 caggcaaaga tcgagaaagc tggggtttct gtgatggttt tacccggaaa ctgtgatcct 300 tccgaggtat ggcttgacaa gttcgccaag gttctcgagt gccaaaggaa caacaaggac 360 caggcggtgg tttcagtgtt gtacggtgac agtctattac gggaccaatg gcttagcgag 420 ctggatttca gaggcttatc acgaattcac caatccagga aggaatgtag tgactctata 480 cttgtagaag agattgtgag agatgtgtac gagacgcact tttatgttgg acgaattgga 540 atctattcga agctgctgga gattgaaaac atggttaaca agcaaccgat aggcatccgt 600 tgtgttggaa tttggggtat gcctggcata ggaaagacaa cacttgctaa agcagtcttt 660 gaccaaatgt ctagcgcctt tgatgcttct tgttttatcg aagactatga caaatcaatt 720 catgagaagg gtctttattg tttgctggag gaacaacttt tgccgggcaa tgatgcaacc 780 attatgaaac tgagctcgct cagagacaga ttgaacagta agagagttct tgttgttctc 840 gatgacgtgc gcaatgctct ggttggggag tcctttctcg aggggtttga ctggctagga 900 cccggaagcc tgatcatcat aacctctaga gataaacaag tgttttgcct ttgcggaatc 960 aatcaaatat atgaggtcca gggtttaaat gagaaagagg ctcgtcaact tttcttgctg 1020 tctgcgtcta taaaggagga tatgggagag cagaatctcc aggagttgtc agtgagagta 1080 ataaattatg ctaatggaaa cccgttagct atcagtgttt atggaagaga gctgaaaggt 1140 aagaaaaaac tctcagaaat ggagactgca ttcctcaaac tcaagcgacg tcctccattt 1200 aagattgtcg atgcatttaa aagcacctat gacacactca gtgacaacga aaagaacatt 1260 tttttggaca tagcttgttt cttccaggga gaaaatgtca actacgtgat acaactgctt 1320 gagggttgtg gtttctttcc acatgttgaa attgatgtcc ttgttgacaa gtgtctggta 1380 actatttcag aaaaccgagt ttggttgcat aagctgaccc aggatatcgg ccgagaaatc 1440 ataaatggag aaacagtaca gatcgagagg cgcagaagac tgtgggaacc ttggagcatc 1500 aaatatttat tagaatataa tgaacacaaa gcaaatggag aacctaaaac aaccttcaaa 1560 cgtgctcagg gctctgaaga gatcgaaggc ctgtttctag acacatcaaa cttaagattt 1620 gatctgcagc cctctgcctt taagaatatg ttgaacctta gattgctcaa aatttattgt 1680 tccaatcctg aagtccatcc tgtaatcaat ttcccaacag gctctctgca ttctcttcct 1740 aatgagctaa gactcctcca ttgggagaac tatcctctga aatctttgcc tcagaatttt 1800 gatccaaggc accttgtcga aatcaacatg ccgtatagtc aacttcagaa actttggggt 1860 ggaaccaaga acctggagat gttgaggacg atcaggcttt gccattccca ccatctagtt 1920 gatatcgatg atctcttaaa agctgaaaat cttgaggtaa ttgatctcca aggttgtacg 1980 agactgcaga atttcccagc cgcaggtcga ttgctacgtc tacgagttgt aaatctctca 2040 ggttgcataa agattaaaag tgtcctagaa attccaccaa atattgagaa actacatcta 2100 cagggaactg gcatattagc attaccagtt tccactgtta agccaaacca tagagagctt 2160 gtgaattttc taacagaaat tccgggtctt tcagaggaac ttgagcgttt aacaagtctg 2220 ctggaatcta actcatcttg tcaagatctt gggaagctta tttgcttgga gctgaaagat 2280 tgctcttgtt tgcagagtct gccaaacatg gctaatttag atcttaatgt tcttgatctc 2340 tcgggttgct caagtcttaa ttctattcag ggtttccctc gttttctgaa acagttatat 2400 cttggtggca ctgcaataag agaagtgcca caacttcctc aaagtctaga aatcttgaat 2460 gcacatggat cttgtttgcg aagtctgcca aacatggcta atttagaatt tctcaaagtt 2520 cttgatctct ctggttgctc agagctcgag actattcagg gttttcctcg gaacctaaaa 2580 gagttatatt ttgctggcac tacgttaaga gaagtgcccc aacttccttt aagcctagag 2640 gtcttgaatg cacatggttc tgactcggag aagcttccta tgcattacaa gttcaacaat 2700 tttttcgatc tatctcaaca agtggtcaac gattttttat tgaaaacgct gacttatgta 2760 aaacacatac caagagggta tacgcaggaa ctcatcaaca aagctccgac tttcagcttc 2820 agtgcgccct cacatacaaa tcaaaacgcc acatttgatc tgcaatcagg atcttctgta 2880 atgacacgac taaatcattc atggaggaac acgcttgtgg gatttggtat gctggtggaa 2940 gttgcatttc ccgaggacta ctgtgatgct acagatgttg gcataagttg tgtttgcaga 3000 tggagcaaca aagaaggccg ctcttgtagg atagaaagaa aatttcattg ttgggcacca 3060 tggcaagttg ttccaaaagt tcgaaaggat catacgtttg tctttagtga tgtcaacatg 3120 cgcccaagta ccggtgaagg aaatgaccct gatatctggg ctggattagt tgtatttgag 3180 ttctttccta tcaatcagca gacaaagtgt ctaaatgata ggttcacagt gagaagatgt 3240 ggagtccgtg taataaatgt tgcaactggc aatacaagtc ttgagaacat agcactagtt 3300 ttgtctttgg atccagtaga ggtttccggt tatgaagtat tgagagtcag ctatgatgat 3360 ttacaggaga tggataaagt tctatttctt tacatagcgt ctttgttcaa tgacgaggat 3420 gttgattttg tggcaccact tattgccggt attgacttgg atgttagctc tgggctcaag 3480 gtcttagccg atgtgtctct cataagtgta tcatcaaatg gggaaatagt gatgcatagt 3540 ttgcaaagac aaatgggtaa agaaatcctc catggacaat ccatgctgct gtctgattgt 3600 gagagttcca tgaccgagaa tttgtctgac gtaccaaaaa agaagaagaa acatagcgaa 3660 agtagggtaa agaaagtggt ttccataccg gctatagacg agggagatct atggacttgg 3720 cgaaagtacg gtcaaaaaga catcttaggt tctcgttttc caaggggtta ctacaggtgc 3780 gcttacaagt tcacgcatgg ttgtaaagct acaaaacaag tccaacggag cgagaccgat 3840 tcaaacatgt tagctattac ttacctatct gagcataacc atccacggcc cactaaacgc 3900 aaggctctcg ctgactccac tcgttccact tcctcctcca tctgctgagc cataactacc 3960 tctgcctcat ctagagtctt ccaaaacaaa gacgaaccaa atcaacccca cttgccttcc 4020 tcctccactc ctcctggaaa cgcggctgtc ttgtttaaaa tgacggacat ggaggagttt 4080 caggacaata tggaggtgga taatgacgtc gtagatacac gtacactggc attgtttcca 4140 gagtttcaac atcagccgga ggaagaatac ccatggtcaa cattcttcga tgattataat 4200 ttttgttttt attgagctca ctctcatcac tgcaagacaa gaaaaaataa ccagtgagat 4260 ccaaggattt atgtacaccc aaggatttag tttttgcagt gtcaatgatt ttcagccaat 4320 aataaatcat taattt 4336 9 1378 PRT Arabidopsis thaliana 9 Met Thr Asn Cys Glu Lys Asp Glu Glu Phe Val Cys Ile Ser Cys Val 1 5 10 15 Glu Glu Val Arg Tyr Ser Phe Val Ser His Leu Ser Glu Ala Leu Arg 20 25 30 Arg Lys Gly Ile Asn Asn Val Val Val Asp Val Asp Ile Asp Asp Leu 35 40 45 Leu Phe Lys Glu Ser Gln Ala Lys Ile Glu Lys Ala Gly Val Ser Val 50 55 60 Met Val Leu Pro Gly Asn Cys Asp Pro Ser Glu Val Trp Leu Asp Lys 65 70 75 80 Phe Ala Lys Val Leu Glu Cys Gln Arg Asn Asn Lys Asp Gln Ala Val 85 90 95 Val Ser Val Leu Tyr Gly Asp Ser Leu Leu Arg Asp Gln Trp Leu Ser 100 105 110 Glu Leu Asp Phe Arg Gly Leu Ser Arg Ile His Gln Ser Arg Lys Glu 115 120 125 Cys Ser Asp Ser Ile Leu Val Glu Glu Ile Val Arg Asp Val Tyr Glu 130 135 140 Thr His Phe Tyr Val Gly Arg Ile Gly Ile Tyr Ser Lys Leu Leu Glu 145 150 155 160 Ile Glu Asn Met Val Asn Lys Gln Pro Ile Gly Ile Arg Cys Val Gly 165 170 175 Ile Trp Gly Met Pro Gly Ile Gly Lys Thr Thr Leu Ala Lys Ala Val 180 185 190 Phe Asp Gln Met Ser Ser Ala Phe Asp Ala Ser Cys Phe Ile Glu Asp 195 200 205 Tyr Asp Lys Ser Ile His Glu Lys Gly Leu Tyr Cys Leu Leu Glu Glu 210 215 220 Gln Leu Leu Pro Gly Asn Asp Ala Thr Ile Met Lys Leu Ser Ser Leu 225 230 235 240 Arg Asp Arg Leu Asn Ser Lys Arg Val Leu Val Val Leu Asp Asp Val 245 250 255 Cys Asn Ala Leu Val Ala Glu Ser Phe Leu Glu Gly Phe Asp Trp Leu 260 265 270 Gly Pro Gly Ser Leu Ile Ile Ile Thr Ser Arg Asp Lys Gln Val Phe 275 280 285 Arg Leu Cys Gly Ile Asn Gln Ile Tyr Glu Val Gln Gly Leu Asn Glu 290 295 300 Lys Glu Ala Arg Gln Leu Phe Leu Leu Ser Ala Ser Ile Met Glu Asp 305 310 315 320 Met Gly Glu Gln Asn Leu His Glu Leu Ser Val Arg Val Ile Ser Tyr 325 330 335 Ala Asn Gly Asn Pro Leu Ala Ile Ser Val Tyr Gly Arg Glu Leu Lys 340 345 350 Gly Lys Lys Lys Leu Ser Glu Met Glu Thr Ala Phe Leu Lys Leu Lys 355 360 365 Arg Arg Pro Pro Phe Lys Ile Val Asp Ala Phe Lys Ser Ser Tyr Asp 370 375 380 Thr Leu Ser Asp Asn Glu Lys Asn Ile Phe Leu Asp Ile Ala Cys Phe 385 390 395 400 Phe Gln Gly Glu Asn Val Asn Tyr Val Ile Gln Leu Leu Glu Gly Cys 405 410 415 Gly Phe Phe Pro His Val Glu Ile Asp Val Leu Val Asp Lys Cys Leu 420 425 430 Val Thr Ile Ser Glu Asn Arg Val Trp Leu His Lys Leu Thr Gln Asp 435 440 445 Ile Gly Arg Glu Ile Ile Asn Gly Glu Thr Val Gln Ile Glu Arg Arg 450 455 460 Arg Arg Leu Trp Glu Pro Trp Ser Ile Lys Tyr Leu Leu Glu Tyr Asn 465 470 475 480 Glu His Lys Ala Asn Gly Glu Pro Lys Thr Thr Phe Lys Arg Ala Gln 485 490 495 Gly Ser Glu Glu Ile Glu Gly Leu Phe Leu Asp Thr Ser Asn Leu Arg 500 505 510 Phe Asp Leu Gln Pro Ser Ala Phe Lys Asn Met Leu Asn Leu Arg Leu 515 520 525 Leu Lys Ile Tyr Cys Ser Asn Pro Glu Val His Pro Val Ile Asn Phe 530 535 540 Pro Thr Gly Ser Leu His Ser Leu Pro Asn Glu Leu Arg Leu Leu His 545 550 555 560 Trp Glu Asn Tyr Pro Leu Lys Ser Leu Pro Gln Asn Phe Asp Pro Arg 565 570 575 His Leu Val Glu Ile Asn Met Pro Tyr Ser Gln Leu Gln Lys Leu Trp 580 585 590 Gly Gly Thr Lys Asn Leu Glu Met Leu Arg Thr Ile Arg Leu Cys His 595 600 605 Ser Gln His Leu Val Asp Ile Asp Asp Leu Leu Lys Ala Glu Asn Leu 610 615 620 Glu Val Ile Asp Leu Gln Gly Cys Thr Arg Leu Gln Asn Phe Pro Ala 625 630 635 640 Ala Gly Arg Leu Leu Arg Leu Arg Val Val Asn Leu Ser Gly Cys Ile 645 650 655 Lys Ile Lys Ser Val Leu Glu Ile Pro Pro Asn Ile Glu Lys Leu His 660 665 670 Leu Gln Gly Thr Gly Ile Leu Ala Leu Pro Val Ser Thr Val Lys Pro 675 680 685 Asn His Arg Glu Leu Val Asn Phe Leu Thr Glu Ile Pro Gly Leu Ser 690 695 700 Glu Ala Ser Lys Leu Glu Arg Leu Thr Ser Leu Leu Glu Ser Asn Ser 705 710 715 720 Ser Cys Gln Asp Leu Gly Lys Leu Ile Cys Leu Glu Leu Lys Asp Cys 725 730 735 Ser Cys Leu Gln Ser Leu Pro Asn Met Ala Asn Leu Asp Leu Asn Val 740 745 750 Leu Asp Leu Ser Gly Cys Ser Ser Leu Asn Ser Ile Gln Gly Phe Pro 755 760 765 Arg Phe Leu Lys Gln Leu Tyr Leu Gly Gly Thr Ala Ile Arg Glu Val 770 775 780 Pro Gln Leu Pro Gln Ser Leu Glu Ile Leu Asn Ala His Gly Ser Cys 785 790 795 800 Leu Arg Ser Leu Pro Asn Met Ala Asn Leu Glu Phe Leu Lys Val Leu 805 810 815 Asp Leu Ser Gly Cys Ser Glu Leu Glu Thr Ile Gln Gly Phe Pro Arg 820 825 830 Asn Leu Lys Glu Leu Tyr Phe Ala Gly Thr Thr Leu Arg Glu Val Pro 835 840 845 Gln Leu Pro Leu Ser Leu Glu Val Leu Asn Ala His Gly Ser Asp Ser 850 855 860 Glu Lys Leu Pro Met His Tyr Lys Phe Asn Asn Phe Phe Asp Leu Ser 865 870 875 880 Gln Gln Val Val Asn Asp Phe Phe Leu Lys Ala Leu Thr Tyr Val Lys 885 890 895 His Ile Pro Arg Gly Tyr Thr Gln Glu Leu Ile Asn Lys Ala Pro Thr 900 905 910 Phe Ser Phe Ser Ala Pro Ser His Thr Asn Gln Asn Ala Thr Phe Asp 915 920 925 Leu Gln Pro Gly Ser Ser Val Met Thr Arg Leu Asn His Ser Trp Arg 930 935 940 Asn Thr Leu Val Gly Phe Gly Met Leu Val Glu Val Ala Phe Pro Glu 945 950 955 960 Asp Tyr Cys Asp Ala Thr Asp Val Gly Ile Ser Cys Val Cys Arg Trp 965 970 975 Ser Asn Lys Glu Gly Arg Ser Cys Arg Ile Glu Arg Asn Phe His Cys 980 985 990 Trp Ala Pro Gly Lys Val Val Pro Lys Val Arg Lys Asp His Thr Phe 995 1000 1005 Val Phe Ser Asp Val Asn Met Arg Pro Ser Thr Gly Glu Gly Asn Asp 1010 1015 1020 Pro Asp Ile Trp Ala Gly Leu Val Val Phe Glu Phe Phe Pro Ile Asn 1025 1030 1035 1040 Gln Gln Thr Lys Cys Leu Asn Asp Arg Phe Thr Val Thr Arg Cys Gly 1045 1050 1055 Val Arg Val Ile Asn Val Ala Thr Gly Asn Thr Ser Leu Glu Asn Ile 1060 1065 1070 Ser Leu Val Leu Ser Leu Asp Pro Val Glu Val Ser Gly Tyr Glu Val 1075 1080 1085 Leu Arg Val Ser Tyr Asp Asp Leu Gln Glu Met Asp Lys Val Leu Phe 1090 1095 1100 Leu Tyr Ile Ala Ser Leu Phe Asn Asp Glu Asp Val Asp Phe Val Ala 1105 1110 1115 1120 Pro Leu Ile Ala Gly Ile Asp Leu Asp Val Ser Ser Gly Leu Lys Val 1125 1130 1135 Leu Ala Asp Val Ser Leu Ile Ser Val Ser Ser Asn Gly Glu Ile Val 1140 1145 1150 Met His Ser Leu Gln Arg Gln Met Gly Lys Glu Ile Leu His Gly Gln 1155 1160 1165 Ser Met Leu Leu Ser Asp Cys Glu Ser Ser Met Thr Glu Asn Leu Ser 1170 1175 1180 Asp Val Pro Lys Lys Glu Lys Lys His Arg Glu Ser Lys Val Lys Lys 1185 1190 1195 1200 Val Val Ser Ile Pro Ala Ile Asp Glu Gly Asp Leu Trp Thr Trp Arg 1205 1210 1215 Lys Tyr Gly Gln Lys Asp Ile Leu Gly Ser Arg Phe Pro Arg Gly Tyr 1220 1225 1230 Tyr Arg Cys Ala Tyr Lys Phe Thr His Gly Cys Lys Ala Thr Lys Gln 1235 1240 1245 Val Gln Arg Ser Glu Thr Asp Ser Asn Met Leu Ala Ile Thr Tyr Leu 1250 1255 1260 Ser Glu His Asn His Pro Arg Pro Thr Lys Arg Lys Ala Leu Ala Asp 1265 1270 1275 1280 Ser Thr Arg Ser Thr Ser Ser Ser Ile Cys Ser Ala Ile Thr Thr Ser 1285 1290 1295 Ala Ser Ser Arg Val Phe Gln Asn Lys Asp Glu Pro Asn Gln Pro His 1300 1305 1310 Leu Pro Ser Ser Ser Thr Pro Pro Arg Asn Ala Ala Val Leu Phe Lys 1315 1320 1325 Met Thr Asp Met Glu Glu Phe Gln Asp Asn Met Glu Val Asp Asn Asp 1330 1335 1340 Val Val Asp Thr Arg Thr Leu Ala Leu Phe Pro Glu Phe Gln His Gln 1345 1350 1355 1360 Pro Glu Glu Glu Asp Pro Trp Ser Thr Phe Phe Asp Asp Tyr Asn Phe 1365 1370 1375 Tyr Phe 10 1288 PRT Arabidopsis thaliana 10 Met Thr Asn Cys Glu Lys Asp Glu Glu Phe Val Cys Ile Ser Cys Val 1 5 10 15 Glu Glu Val Arg Tyr Ser Phe Val Ser His Leu Ser Glu Ala Leu Arg 20 25 30 Arg Lys Gly Ile Asn Asn Val Val Val Asp Val Asp Ile Asp Asp Leu 35 40 45 Leu Phe Lys Glu Ser Gln Ala Lys Ile Glu Lys Ala Gly Val Ser Val 50 55 60 Met Val Leu Pro Gly Asn Cys Asp Pro Ser Glu Val Trp Leu Asp Lys 65 70 75 80 Phe Ala Lys Val Leu Glu Cys Gln Arg Asn Asn Lys Asp Gln Ala Val 85 90 95 Val Ser Val Leu Tyr Gly Asp Ser Leu Leu Arg Asp Gln Trp Leu Ser 100 105 110 Glu Leu Asp Phe Arg Gly Leu Ser Arg Ile His Gln Ser Arg Lys Glu 115 120 125 Cys Ser Asp Ser Ile Leu Val Glu Glu Ile Val Arg Asp Val Tyr Glu 130 135 140 Thr His Phe Tyr Val Gly Arg Ile Gly Ile Tyr Ser Lys Leu Leu Glu 145 150 155 160 Ile Glu Asn Met Val Asn Lys Gln Pro Ile Gly Ile Arg Cys Val Gly 165 170 175 Ile Trp Gly Met Pro Gly Ile Gly Lys Thr Thr Leu Ala Lys Ala Val 180 185 190 Phe Asp Gln Met Ser Ser Ala Phe Asp Ala Ser Cys Phe Ile Glu Asp 195 200 205 Tyr Asp Lys Ser Ile His Glu Lys Gly Leu Tyr Cys Leu Leu Glu Glu 210 215 220 Gln Leu Leu Pro Gly Asn Asp Ala Thr Ile Met Lys Leu Ser Ser Leu 225 230 235 240 Arg Asp Arg Leu Asn Ser Lys Arg Val Leu Val Val Leu Asp Asp Val 245 250 255 Arg Asn Ala Leu Val Gly Glu Ser Phe Leu Glu Gly Phe Asp Trp Leu 260 265 270 Gly Pro Gly Ser Leu Ile Ile Ile Thr Ser Arg Asp Lys Gln Val Phe 275 280 285 Cys Leu Cys Gly Ile Asn Gln Ile Tyr Glu Val Gln Gly Leu Asn Glu 290 295 300 Lys Glu Ala Arg Gln Leu Phe Leu Leu Ser Ala Ser Ile Lys Glu Asp 305 310 315 320 Met Gly Glu Gln Asn Leu Gln Glu Leu Ser Val Arg Val Ile Asn Tyr 325 330 335 Ala Asn Gly Asn Pro Leu Ala Ile Ser Val Tyr Gly Arg Glu Leu Lys 340 345 350 Gly Lys Lys Lys Leu Ser Glu Met Glu Thr Ala Phe Leu Lys Leu Lys 355 360 365 Arg Arg Pro Pro Phe Lys Ile Val Asp Ala Phe Lys Ser Thr Tyr Asp 370 375 380 Thr Leu Ser Asp Asn Glu Lys Asn Ile Phe Leu Asp Ile Ala Cys Phe 385 390 395 400 Phe Gln Gly Glu Asn Val Asn Tyr Val Ile Gln Leu Leu Glu Gly Cys 405 410 415 Gly Phe Phe Pro His Val Glu Ile Asp Val Leu Val Asp Lys Cys Leu 420 425 430 Val Thr Ile Ser Glu Asn Arg Val Trp Leu His Lys Leu Thr Gln Asp 435 440 445 Ile Gly Arg Glu Ile Ile Asn Gly Glu Thr Val Gln Ile Glu Arg Arg 450 455 460 Arg Arg Leu Trp Glu Pro Trp Ser Ile Lys Tyr Leu Leu Glu Tyr Asn 465 470 475 480 Glu His Lys Ala Asn Gly Glu Pro Lys Thr Thr Phe Lys Arg Ala Gln 485 490 495 Gly Ser Glu Glu Ile Glu Gly Leu Phe Leu Asp Thr Ser Asn Leu Arg 500 505 510 Phe Asp Leu Gln Pro Ser Ala Phe Lys Asn Met Leu Asn Leu Arg Leu 515 520 525 Leu Lys Ile Tyr Cys Ser Asn Pro Glu Val His Pro Val Ile Asn Phe 530 535 540 Pro Thr Gly Ser Leu His Ser Leu Pro Asn Glu Leu Arg Leu Leu His 545 550 555 560 Trp Glu Asn Tyr Pro Leu Lys Ser Leu Pro Gln Asn Phe Asp Pro Arg 565 570 575 His Leu Val Glu Ile Asn Met Pro Tyr Ser Gln Leu Gln Lys Leu Trp 580 585 590 Gly Gly Thr Lys Asn Leu Glu Met Leu Arg Thr Ile Arg Leu Cys His 595 600 605 Ser His His Leu Val Asp Ile Asp Asp Leu Leu Lys Ala Glu Asn Leu 610 615 620 Glu Val Ile Asp Leu Gln Gly Cys Thr Arg Leu Gln Asn Phe Pro Ala 625 630 635 640 Ala Gly Arg Leu Leu Arg Leu Arg Val Val Asn Leu Ser Gly Cys Ile 645 650 655 Lys Ile Lys Ser Val Leu Glu Ile Pro Pro Asn Ile Glu Lys Leu His 660 665 670 Leu Gln Gly Thr Gly Ile Leu Ala Leu Pro Val Ser Thr Val Lys Pro 675 680 685 Asn His Arg Glu Leu Val Asn Phe Leu Thr Glu Ile Pro Gly Leu Ser 690 695 700 Glu Glu Leu Glu Arg Leu Thr Ser Leu Leu Glu Ser Asn Ser Ser Cys 705 710 715 720 Gln Asp Leu Gly Lys Leu Ile Cys Leu Glu Leu Lys Asp Cys Ser Cys 725 730 735 Leu Gln Ser Leu Pro Asn Met Ala Asn Leu Asp Leu Asn Val Leu Asp 740 745 750 Leu Ser Gly Cys Ser Ser Leu Asn Ser Ile Gln Gly Phe Pro Arg Phe 755 760 765 Leu Lys Gln Leu Tyr Leu Gly Gly Thr Ala Ile Arg Glu Val Pro Gln 770 775 780 Leu Pro Gln Ser Leu Glu Ile Leu Asn Ala His Gly Ser Cys Leu Arg 785 790 795 800 Ser Leu Pro Asn Met Ala Asn Leu Glu Phe Leu Lys Val Leu Asp Leu 805 810 815 Ser Gly Cys Ser Glu Leu Glu Thr Ile Gln Gly Phe Pro Arg Asn Leu 820 825 830 Lys Glu Leu Tyr Phe Ala Gly Thr Thr Leu Arg Glu Val Pro Gln Leu 835 840 845 Pro Leu Ser Leu Glu Val Leu Asn Ala His Gly Ser Asp Ser Glu Lys 850 855 860 Leu Pro Met His Tyr Lys Phe Asn Asn Phe Phe Asp Leu Ser Gln Gln 865 870 875 880 Val Val Asn Asp Phe Leu Leu Lys Thr Leu Thr Tyr Val Lys His Ile 885 890 895 Pro Arg Gly Tyr Thr Gln Glu Leu Ile Asn Lys Ala Pro Thr Phe Ser 900 905 910 Phe Ser Ala Pro Ser His Thr Asn Gln Asn Ala Thr Phe Asp Leu Gln 915 920 925 Ser Gly Ser Ser Val Met Thr Arg Leu Asn His Ser Trp Arg Asn Thr 930 935 940 Leu Val Gly Phe Gly Met Leu Val Glu Val Ala Phe Pro Glu Asp Tyr 945 950 955 960 Cys Asp Ala Thr Asp Val Gly Ile Ser Cys Val Cys Arg Trp Ser Asn 965 970 975 Lys Glu Gly Arg Ser Cys Arg Ile Glu Arg Lys Phe His Cys Trp Ala 980 985 990 Pro Trp Gln Val Val Pro Lys Val Arg Lys Asp His Thr Phe Val Phe 995 1000 1005 Ser Asp Val Asn Met Arg Pro Ser Thr Gly Glu Gly Asn Asp Pro Asp 1010 1015 1020 Ile Trp Ala Gly Leu Val Val Phe Glu Phe Phe Pro Ile Asn Gln Gln 1025 1030 1035 1040 Thr Lys Cys Leu Asn Asp Arg Phe Thr Val Arg Arg Cys Gly Val Arg 1045 1050 1055 Val Ile Asn Val Ala Thr Gly Asn Thr Ser Leu Glu Asn Ile Ala Leu 1060 1065 1070 Val Leu Ser Leu Asp Pro Val Glu Val Ser Gly Tyr Glu Val Leu Arg 1075 1080 1085 Val Ser Tyr Asp Asp Leu Gln Glu Met Asp Lys Val Leu Phe Leu Tyr 1090 1095 1100 Ile Ala Ser Leu Phe Asn Asp Glu Asp Val Asp Phe Val Ala Pro Leu 1105 1110 1115 1120 Ile Ala Gly Ile Asp Leu Asp Val Ser Ser Gly Leu Lys Val Leu Ala 1125 1130 1135 Asp Val Ser Leu Ile Ser Val Ser Ser Asn Gly Glu Ile Val Met His 1140 1145 1150 Ser Leu Gln Arg Gln Met Gly Lys Glu Ile Leu His Gly Gln Ser Met 1155 1160 1165 Leu Leu Ser Asp Cys Glu Ser Ser Met Thr Glu Asn Leu Ser Asp Val 1170 1175 1180 Pro Lys Lys Lys Lys Lys His Ser Glu Ser Arg Val Lys Lys Val Val 1185 1190 1195 1200 Ser Ile Pro Ala Ile Asp Glu Gly Asp Leu Trp Thr Trp Arg Lys Tyr 1205 1210 1215 Gly Gln Lys Asp Ile Leu Gly Ser Arg Phe Pro Arg Gly Tyr Tyr Arg 1220 1225 1230 Cys Ala Tyr Lys Phe Thr His Gly Cys Lys Ala Thr Lys Gln Val Gln 1235 1240 1245 Arg Ser Glu Thr Asp Ser Asn Met Leu Ala Ile Thr Tyr Leu Ser Glu 1250 1255 1260 His Asn His Pro Arg Pro Thr Lys Arg Lys Ala Leu Ala Asp Ser Thr 1265 1270 1275 1280 Arg Ser Thr Ser Ser Ser Ile Cys 1285 11 18 DNA Homo sapiens 11 agctcgagac tattcagg 18 12 19 DNA Homo sapiens 12 aaacactgat agctaacgg 19 13 18 DNA Homo sapiens 13 atctctaacg gtggatgg 18 14 19 DNA Homo sapiens 14 tgcattcaag acctctagg 19 15 19 DNA Homo sapiens 15 ccgttagcta tcagtgttt 19 16 18 DNA Homo sapiens 16 agtcatcaag tgaccatc 18 17 18 DNA Homo sapiens 17 ctagaggtct tgaatgca 18 18 18 DNA Homo sapiens 18 gcatcacagt agtcctcg 18 19 19 DNA Homo sapiens 19 acatccaagt caataccgg 19 20 18 DNA Homo sapiens 20 gccaatagag atgtacca 18 21 18 DNA Homo sapiens 21 tggtacatct ctaatggc 18 22 19 DNA Homo sapiens 22 agtaacacgt aatgtaacc 19 23 19 DNA Homo sapiens 23 accagcaagt ttaggatga 19 24 18 DNA Homo sapiens 24 gatggtcact tgatgact 18 25 19 DNA Homo sapiens 25 ggtgtacata aatccttgg 19 26 19 DNA Homo sapiens 26 ccaaggattt atgtacacc 19 27 18 DNA Homo sapiens 27 actcttatgg agatgctc 18 28 18 DNA Homo sapiens 28 cgcatcctta aactactg 18 29 18 DNA Homo sapiens 29 atatctccgg tttcaacc 18 30 18 DNA Homo sapiens 30 ccttggtgag tagctcac 18 31 18 DNA Homo sapiens 31 ccatagatct ccctcgtc 18 32 19 DNA Homo sapiens 32 ccttatagaa cttctctcc 19 33 18 DNA Homo sapiens 33 ctcttcgagt gcatcagg 18 34 19 DNA Homo sapiens 34 ccggtattga cttggatgt 19 35 18 DNA Homo sapiens 35 agatacacgt acactggc 18 36 18 DNA Homo sapiens 36 tccagcccag atatcagg 18 37 18 DNA Homo sapiens 37 tgcataggaa gcttctcc 18 38 18 DNA Homo sapiens 38 ttcagaggaa cttgagcg 18 39 19 DNA Homo sapiens 39 ccaagcaaat aagcttccc 19 40 18 DNA Homo sapiens 40 atcgtcctca acatctcc 18 41 18 DNA Homo sapiens 41 aggcgcagaa gactgtgg 18 42 18 DNA Homo sapiens 42 ttgatgctcc aaggttcc 18 43 18 DNA Homo sapiens 43 gagatgtgta cgagacgc 18 44 18 DNA Homo sapiens 44 caatctccag cagcttcg 18 45 18 DNA Homo sapiens 45 ttgagtggtt gaatgtcc 18 46 18 DNA Homo sapiens 46 cacacgaatt cctcatcc 18 47 18 DNA Homo sapiens 47 tgaaggaaca ctcgttgc 18 48 18 DNA Homo sapiens 48 gtctttcaga ggcctcga 18 49 18 DNA Homo sapiens 49 ggtaagcaat ctctgata 18 50 18 DNA Homo sapiens 50 atgttatatc gacgttgg 18 51 18 DNA Homo sapiens 51 gaggaagtgg aacgagtg 18 52 18 DNA Homo sapiens 52 aactcctcca tgtccgtc 18 53 26 DNA Homo sapiens 53 atctccctcg tctatagccg gtatgg 26 54 27 DNA Homo sapiens 54 gatcaggctt ccgggtccta gccagtc 27 55 27 DNA Homo sapiens 55 agtgatgtca acatgcgccc aagtacc 27 56 26 DNA Homo sapiens 56 aaccttcaaa cgtgctcagg gctctg 26 57 26 DNA Homo sapiens 57 acatctccag gttcttggtt ccaccc 26 58 29 DNA Homo sapiens 58 gtcgacatga ccaattgtga aaaggatga 29 59 27 DNA Homo sapiens 59 gtcgaccttg tcttgcagtg atgagag 27 60 23 DNA Homo sapiens 60 ctattccatg gaggaggaag tgg 23 61 24 DNA Homo sapiens 61 ttagtcgacg aagaagaaac atag 24 

1. Nucleic acid containing at least 15 consecutive nucleotides of a nucleotide sequence coding for a protein of resistance of a plant to a pathogen, said protein comprising: a) an N-terminal portion containing at least one amino acid sequence rich in leucine and at least one nucleotide-binding site; and b) a C-terminal portion containing a DNA-binding domain, said binding domain comprising the amino acid sequence “WRKYGQK”, as well as a nucleic acid of complementary sequence.
 2. Nucleic acid according to claim 1, characterized in that it has at least 40%, advantageously 60%, preferably 80% and more preferably 90% identity in nucleotides with the nucleotide sequence SEQ ID N^(o) 1, as well as a nucleic acid of complementary sequence.
 3. Nucleic acid according to claim 2, characterized in that it has at least 40%, advantageously 60%, preferably 80% and more preferably 90% identity in nucleotides with a nucleotide sequence selected from among the following sequences: a) the nucleotide sequence between the nucleotide in position 260 and the nucleotide in position 636 of the nucleic acid SEQ ID N^(o) 1; b) the nucleotide sequence between the nucleotide in position 746 and the nucleotide in position 1856 of the nucleic acid SEQ ID N^(o) 1; c) the nucleotide sequence between the nucleotide in position 1937 and the nucleotide in position 2236 of the nucleic acid SEQ ID N^(o) 1; d) the nucleotide sequence between the nucleotide in position 2326 and the nucleotide in position 3249 of the nucleic acid SEQ ID N^(o) 1; e) the nucleotide sequence between the nucleotide in position 3438 and the nucleotide in position 4291 of the nucleic acid SEQ ID N^(o) 1; f) the nucleotide sequence between the nucleotide in position 5377 and the nucleotide in position 5499 of the nucleic acid SEQ ID N^(o) 1; g) the nucleotide sequence between the nucleotide in position 6085 and the nucleotide in position 6532 of the nucleic acid SEQ ID N^(o) 1; as well as a nucleic acid of complementary sequence.
 4. Nucleic acid according to claim 2, characterized in that it has at least 40%, advantageously 60%, preferably 80% and more preferably 90% identity in nucleotides with a nucleotide sequence selected from among the following sequences: a) the nucleotide sequence between the nucleotide in position 637 and the nucleotide in position 745 of the nucleic acid SEQ ID N^(o) 1; b) the nucleotide sequence between the nucleotide in position 1857 and the nucleotide in position 1936 of the nucleic acid SEQ ID N^(o) 1; c) the nucleotide sequence between the nucleotide in position 2237 and the nucleotide in position 2325 of the nucleic acid SEQ ID N^(o) 1; d) the nucleotide sequence between the nucleotide in position 3250 and the nucleotide in position 3437 of the nucleic acid SEQ ID N^(o) 1; e) the nucleotide sequence between the nucleotide in position 4292 and the nucleotide in position 5376 of the nucleic acid SEQ ID N^(o) 1; f) the nucleotide sequence between the nucleotide in position 5500 and the nucleotide in position 6084 of the nucleic acid SEQ ID N^(o) 1; as well as a nucleic acid of complementary sequence.
 5. Nucleic acid according to claim 1, characterized in that it contains at least 15 consecutive nucleotides of the nucleotide sequence SEQ ID N^(o) 4, as well as a nucleic acid of complementary sequence.
 6. Nucleic acid according to claim 1, characterized in that it has at least 40%, advantageously 60%, preferably 80% and more preferably 90% identity in nucleotides with the nucleotide sequence SEQ ID N^(o) 5, as well as a nucleic acid of complementary sequence.
 7. Nucleic acid according to claim 6, characterized in that it has at least 40%, advantageously 60%, preferably 80% and more preferably 90% identity in nucleotides with a nucleotide sequence selected from among the following sequences: a) the nucleotide sequence between the nucleotide in position 1184 and the nucleotide in position 1560 of the nucleic acid SEQ ID N^(o) 5; b) the nucleotide sequence between the nucleotide in position 1670 30 and the nucleotide in position 278 of the nucleic acid SEQ ID N^(o) 5; c) the nucleotide sequence between the nucleotide in position 2861 and the nucleotide in position 3160 of the nucleic acid SEQ ID N^(o) 5; d) the nucleotide sequence between the nucleotide in position 3254 and the nucleotide in position 4171 of the nucleic acid SEQ ID N^(o) 5 e) the nucleotide sequence between the nucleotide in position 4360 and the nucleotide in position 5213 of the nucleic acid SEQ ID N^(o) 5; f) the nucleotide sequence between the nucleotide in position 6302 and the nucleotide in position 6424 of the nucleic acid SEQ ID N^(o) 5; g) the nucleotide sequence between the nucleotide in position 6953 and the nucleotide in position 7136 of the nucleic acid SEQ ID N^(o) 5; as well as a nucleic acid of complementary sequence.
 8. Nucleic acid according to claim 6, characterized in that it has at least 40%, advantageously 60%, preferably 80% and more preferably 90% identity in nucleotides with a nucleotide sequence selected from among the following sequences: a) the nucleotide sequence between the nucleotide in position 1561 and the nucleotide in position 1669 of the nucleic acid SEQ ID N^(o) 5; b) the nucleotide sequence between the nucleotide in position 2781 and the nucleotide in position 2860 of the nucleic acid SEQ ID N^(o) 5; c) the nucleotide sequence between the nucleotide in position 3161 and the nucleotide in position 3253 of the nucleic acid SEQ ID N^(o) 5; d) the nucleotide sequence between the nucleotide in position 4172 and the nucleotide in position 4359 of the nucleic acid SEQ ID N^(o) 5; e) the nucleotide sequence between the nucleotide in position 5214 and the nucleotide in position 6301 of the nucleic acid SEQ ID N^(o) 5; f) the nucleotide sequence between the nucleotide in position 6425 and the nucleotide in position 6592 of the nucleic acid SEQ ID N^(o) 5; as well as a nucleic acid of complementary sequence.
 9. Nucleic acid according to claim 1, characterized in that it contains at least 15 consecutive nucleotides of the nucleotide sequence SEQ ID N^(o) 8, as well as a nucleic acid of complementary sequence.
 10. Hybridizing nucleic acid, under very strict hybridization conditions, with a nucleic acid according to any one of claims 1 to
 9. 11. Hybridizing nucleotide probe, under very strict hybridization conditions, with a nucleic acid according to any one of claims 1 to
 10. 12. Hybridizing nucleotide primer, under very strict hybridization conditions, with a nucleic acid according to any one of claims 1 to
 10. 13. Nucleic acid according to claim 10, characterized in that it is selected from among the polynucleotides of sequences SEQ ID N^(o) 11 to
 61. 14. Antisense nucleotide sequence containing at least 15 consecutive nucleotides of a nucleic acid according to any one of claims 1 to
 10. 15. Recombinant vector, characterized in that it contains a nucleic acid according to any one of claims 1 to
 10. 16. Recombinant vector according to claim 15, characterized in that it is a vector of functional expression in a plant host cell.
 17. Recombinant vector according to one of claims 15 and 16, characterized in that it is a vector of fungal, bacterial or viral origin.
 18. Host cell transformed with a nucleic acid according to one of claims 1 to 10 or with a recombinant vector according to one of claims 15 to
 17. 19. Host cell transformed according to claim 18, characterized in that it is a cell of prokaryotic or eukaryotic origin.
 20. Host cell transformed according to claim 19, characterized in that it is a cell of Agrobacterium tumefaciens.
 21. Host cell transformed according to claim 19, characterized in that it is a plant cell of Arabidopsis thaliana, of rape, of corn or of tobacco.
 22. Recombinant multicellular plant organism, characterized in that it contains at least one host cell transformed according to one of claims 18 to
 21. 23. Plant transformed with a nucleic acid according to one of claims 1 to 10, a nucleotide sequence according to claim 13 or a recombinant vector according to one of claims 15 to
 17. 24. Transformed plant containing, in a form integrated into its genome, a nucleic acid according to one of claims 1 to 10, a nucleotide sequence according to claim 13 or a recombinant vector according to one of claims 15 to
 17. 25. Method for obtaining a transformed plant, characterized in that it comprises the following steps a) obtaining a transformed plant host cell according to one of claims 19 to 21; b) regenerating an entire plant from the recombinant host cell obtained from step a); c) selecting the plants obtained from step b) which have integrated a polynucleotide of interest selected from among the nucleic acids according to one of claims 1 to
 10. 26. Method for obtaining a transformed plant, characterized in that it comprises the following steps: a) obtaining a host cell of Agrobacterium tumefaciens according to claim 20; b) transforming the plant by infection with the cells of Agrobacterium tumefaciens obtained from step a); c) selecting the plants which have integrated a polynucleotide of interest selected from among the nucleic acids according to one of claims 1 to
 10. 27. Method for obtaining a transformed plant, characterized in that it comprises the following steps: a) transfecting a plant cell with a nucleic acid according to one of claims 1 to 10 or a recombinant vector according to one of claims 15 to 17; b) regenerating an entire plant from the recombinant plant cells obtained from step a); c) selecting the plants which have integrated the nucleic acid according to one of claims 1 to 10 or the recombinant vector according to one of claims 15 to
 17. 28. Method for obtaining a transformed plant according to one of claims 25 to 27, characterized in that it additionally comprises the steps of: d) crossing between themselves two transformed plants such as those obtained from step c); e) selecting the plants homozygous for the nucleic acid of interest.
 29. Method for obtaining a transformed plant according to one of claims 25 to 27, characterized in that it additionally comprises the steps of: d) crossing a transformed plant obtained from step c) with a plant of the same species; e) selecting the plants derived from the crossing of step d) which have retained the nucleic acid of interest.
 30. Transformed plant such as that obtained according to the method according to any one of claims 25 to
 29. 31. Seed of a transformed plant according to any one of claims 23, 24 and
 29. 32. Seed of a plant whose component cells contain in their genome a nucleic acid according to one of claims 1 to
 10. 33. Use of a nucleic acid according to one of claims 1 to 10 for the in vitro or in vivo expression of a protein selected from among the RRS1-S or RRS1-R proteins or of a peptide fraction of these.
 34. Use according to claim 33, characterized in that it is an in vivo expression in a plant transformed with such a nucleic acid.
 35. Use of a nucleotide sequence according to claim 14, or of a recombinant vector containing a nucleotide sequence according to claim 14, to inhibit or block the expression of the gene coding for the RRS1-S protein or for the RRS1-R protein.
 36. Method for detecting a component nucleic acid of the RRS1-S or RRS1-R gene in a sample, comprising the steps of: a) placing a probe or a number of probes according to claim 11 in contact with the nucleic acid which may be contained in the sample; b) detecting any hybrid formed between the nucleic acid of the sample and the probe or probes.
 37. Kit or pack for detecting a component nucleic acid of the RRS1-S or RRS1-R gene in a sample, comprising: a) a probe or a number of probes according to claim 11; b) optionally, the reagents necessary for the hybridization reaction.
 38. Method for amplifying a component nucleic acid of the RRS1-S or RRS1-R gene in a sample, comprising the steps of: a) Placing a pair of primers according to one of claims 12 and 13 in contact with the nucleic acid which may be contained in the sample; b) performing at least one amplification cycle of the nucleic acid contained in the sample; c) detecting any nucleic acid which has been amplified.
 39. Kit or pack for amplifying a nucleic acid in a sample, comprising: a) a pair of primers according to one of claims 12 and 13; b) optionally, the reagents necessary for performing the amplification reaction.
 40. Polypeptide coded by a nucleic acid according to any one of claims 1 to
 10. 41. Polypeptide according to claim 40, characterized in that it comprises an amino acid sequence SEQ ID N^(o) 9 or a polypeptide having at least 40% identity in amino acids with the sequence SEQ ID N^(o)
 9. 42. Polypeptide according to claim 40, characterized in that it comprises an amino acid sequence SEQ ID N^(o) 10 or a polypeptide having at least 40% identity in amino acids with the sequence SEQ ID N^(o)
 10. 43. Polypeptide containing amino acid modifications of 1, 2, 3, 4, 5, 10 to 20 substitutions, additions or deletions of an amino acid compared to the amino acid sequence of a polypeptide according to one of claims 41 and
 42. 44. Polypeptide containing at least 5 consecutive amino acids of a polypeptide according to one of claims 40 to
 43. 45. Fusion polypeptide, characterized in that it comprises the N-terminal portion of the RRS1-S polypeptide fused with the C-terminal portion of the RRS1-R polypeptide.
 46. Fusion polypeptide, characterized in that it comprises the N-terminal portion of the RRS1-R polypeptide fused with the C-terminal portion of the RRS1-S polypeptide.
 47. Nucleic acid coding for a polypeptide according to one of claims 45 and
 46. 48. Antibody directed against a polypeptide according to one of claims 40 to
 46. 49. Method for detecting the presence of a polypeptide according to one of claims 40 to 44 in a sample, comprising the steps of: a) placing the sample in contact with an antibody according to claim 48; b) detecting any antigen/antibody complex formed.
 50. Diagnostic kit or pack for detecting the presence of a polypeptide according to one of claims 40 to 44 in a sample, characterized in that it comprises: a) an antibody according to claim 48; b) optionally, the reagents necessary for detecting the antigen/antibody complexes formed.
 51. Method for screening a candidate substance fixing to a RRS1-S or RRS1-R polypeptide, characterized in that it comprises the steps of: a) preparing a polypeptide according to one of claims 40 to 44; b) obtaining a candidate substance to be tested; c) placing the polypeptide from step a) in contact with the candidate substance from step b); d) detecting any complex formed between the polypeptide and the candidate substance.
 52. Kit or pack for screening a candidate substance fixing to a RRS1-S or RRS1-R polypeptide, comprising: a) a polypeptide according to one of claims 40 to 44; b) optionally, the reagents necessary for detecting the complexes formed between the polypeptide and a candidate substance to be tested.
 53. Method for screening a candidate substance fixing to a RRS1-S or RRS1-R polypeptide, characterized in that it comprises the steps of: a) obtaining a first nucleic acid coding for a fusion protein comprising a part of the polypeptide of interest fused to the DNA-binding domain of a transcription factor such as Gal4; b) obtaining a second nucleic acid coding for a fusion protein comprising the candidate substance fused to the transcription domain of a transcription factor such as Gal4; c) producing a nucleic acid containing a nucleotide sequence coding for a detectable marker, placed under the control of a regulatory sequence recognized by a transcription factor such as Gal4; the nucleic acids a), b) and c) being inserted into appropriate vectors; and d) co-transfecting yeast cells simultaneously with said vectors; e) detecting the expression of the nucleotide sequence coding for the detectable marker.
 54. Kit or pack for screening a candidate substance fixing to the RRS1-S or RRS1-R polypeptide, comprising: a) a first nucleic acid coding for a fusion protein comprising a part of the polypeptide of interest fused to the DNA-binding domain of a transcription factor such as Gal4; b) optionally, a second nucleic acid containing a nucleotide sequence coding for a detectable marker, placed under the control of a regulatory sequence recognized by the transcription factor such as Gal4; c) a third nucleic acid coding for a fusion protein comprising the candidate substance fused to the transcription domain of the transcription factor such as Gal4;
 55. Substance able to fix to the RRS1-S or RRS1-R polypeptide, characterized in that it may be obtained by a method according to one of claims 51 and
 53. 56. Method for screening a nucleic acid interacting with a polypeptide according to one of claims 40 to 44, comprising the steps of: a) obtaining a statistical population of nucleic acids of 20 to 50 nucleotides in length; b) placing the population of nucleic acids from step a) in contact with a polypeptide according to one of claims 40 to 44; c) characterizing the nucleic acid or acids interacting with said polypeptide.
 57. Kit or pack for screening a nucleic acid interacting with a polypeptide according to one of claims 40 to 44, comprising: a) a polypeptide according to one of claims 40 to 44; b) optionally, a statistical population of nucleic acids of 20 to 50 nucleotides in length. 